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Transgenic non-human animals capable of producing heterologous antibodies 



TECHNICAL FIELD 
10 The invention relates to transgenic non-human 

animals capable of producing heterologous antibodies, 
transgenes used to produce such transgenic animals, 
transgenes capable of functionally rearranging a heterologous 
D gene in V-D-J recombination, immortalized B-cells capable of 
15 producing heterologous antibodies, methods and transgenes for 
producing heterologous antibodies of multiple isotypes, 
methods and transgenes for inactivating or suppressing 
expression of endogenous immunoglobulin loci, methods and 
transgenes for producing heterologous antibodies wherein a 
20 variable region sequence comprises somatic mutation as 

compared to germline rearranged variable region sequences, and 
transgenic nonhuman animals which produce antibodies having a 
human primary sequence and which bind to human antigens. 



25 

BACKGROUND OF THE INVENTION 
One of the major impediments facing the development 
of in vivo therapeutic and diagnostic applications for 
monoclonal antibodies in humans is the intrinsic 

30 immunogenicity of non-human immunoglobulins. For example, when 
immunocompetent human patients are administered therapeutic 
doses of rodent monoclonal antibodies, the patients produce 
antibodies against the rodent immunoglobulin sequences; these 
human anti-mousfe antibodies (HAMA) neutralize the therapeutic 

35 antibodies and can cause acute toxicity. Hence, it is 

desirable to produce human immunoglobulins that are reactive 
with specific human antigens that are promising therapeutic 
and/or diagnostic targets. However, producing human 



WO 93/12227 



PCI7US92/10983 



immunoglobulins that bind specifically *ith human antigens is 
pr 6b lematic - 

The present technology for generating monoclonal 
antibodies involves pre-exposing, or priming, an animal 
5 (usually a rat or mouse) with antigen, harvesting B-cells from 
that animal, and generating a library of hybridoma clones. By 
screening a hybridoma population for- antigen binding 
specificity (idiotype) and also screening for immunoglobulin 
class (isotype), it is possible to select hybridoma clones 
10 that secrete the desired antibody. 

However, when present methods for generating 
monoclonal antibodies are applied for the purpose of 
generating human antibodies that have binding specificities 
for human antigens, obtaining B-lymphocytes which produce 
15 human "immunoglobulins a serious obstacle, since humans will 
typically not make immune responses against self -antigens . 

Hence, present methods of generating human 
monoclonal antibodies that are specifically reactive with 
human antigens are clearly insufficient. It is evident that 
20 the same limitations on generating monoclonal antibodies to 

authentic self antigens apply where non-human species are used 
as the source of B-cells for making the hybridoma. 

The construction of transgenic animals harboring a 
functional heterologous immunoglobulin transgene are a method 
25 by which antibodies reactive with self antigens may be 
produced. However, in order to obtain expression of 
therapeutically useful antibodies, or hybridoma clones 
producing such antibodies, the transgenic animal must produce 
transgenic B cells that are capable of maturing through the B 
30 lymphocyte development pathway. Such maturation requires the 
presence of surface IgM on the transgenic B cells, however 
isotypes other than IgM are desired for therapeutic uses. 
Thus, there is a need for transgenes and animals harboring 
such transgenes that are able to undergo functional V-D-J 
35 rearrangement to generate recombinational diversity and 

junctional diversity. Further, such transgenes and transgenic 
animals preferably include cis-acting sequences that 
facilitate isotype switching from a first isotype that is 
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required for B cell maturation to a subsequent • isotype that 
has superior therapeutic utility. 

A number of experiments have reported the use of 
transfected cell lines to determine the specific DNA sequences 
5 required for Ig gene rearrangement (reviewed by Lewis and 
Gellert (1989), Cell , 59, 585-588). Such reports have 
identified putative sequences and concluded that the 
accessibility of these sequences to the recombinase enzymes 
used for rearrangement is modulated by transcription 

10 (Yancopoulos and Alt (1985), Cell , 40, 271-281). The 

sequences for V(D)J joining are reportedly a highly conserved, 
near-palindromic heptamer and a less well conserved AT-rich 
nanomer separated by a spacer of either 12 or 23 bp (Tonegawa 
(1983), Nature , 302, 575-581; Hesse, et al. (1989), Genes in 

15 Dev. . 3./ 1053-1061) . Efficient recombination reportedly 
occurs only between sites containing recombination signal 
sequences with different length spacer regions. 

Ig gene rearrangement, though studied in tissue 
culture cells, has not been extensively examined in transgenic 

20 mice. Only a handful of reports have been published 

describing rearrangement test constructs introduced into mice 
[Buchini, et al. (1987), Nature . 326, 409-411 ( unr earr anged 
chicken X transgene) ; Goodhart, et al. (1987) , Pyoc. Natl. 
Acad. Sci. USA . 84/ 4229-4233) (unrearranged rabbit k gene) ; 

25 and Bruggemann, et al. (1989), Proc. Nqtj. Acad. Sci. USA, 8£, 
5709-6713 (hybrid mouse-human heavy chain) ] . The results of 
such experiments, however, have been variable, in some cases, 
producing incomplete or minimal rearrangement of the 
transgene. 

30 Further, a variety of biological functions of 

antibody .molecules are exerted by the Fc portion of molecules, 
such as the interaction with mast cells or basophils through 
Fee, and binding of complement by Fcjx or Fey , it further is 
desirable to generate a functional diversity of antibodies of 

35 a given specificity by variation of isotype. 

Although transgenic animals have been generated that 
incorporate transgenes encoding one or more chains of a 
heterologous antibody, there have been no reports of 
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heterologous transgenes that undergo successful -isotype 
switching. Transgenic animals that cannot switch isotypes are 
limited to producing heterologous antibodies of a single 
isotype, and more specifically are limited to producing an 
5 isotype that is essential for B cell maturation, such as IgM 
and possibly IgD, which may be of limited therapeutic utility. 
Thus, there is a need for heterologous immunoglobulin 
transgenes and transgenic animals that are capable of 
switching from an isotype needed for B cell development to an 

10 isotype that has a desired characteristic for therapeutic use. 

Based on the foregoing, it is clear that a need 
exists for methods of efficiently producing heterologous 
antibodies, e.g. antibodies encoded by genetic sequences of a 
first species that are produced in a second species. More 

15 particularly, there is a need in the art for heterologous 
immunoglobulin transgenes and transgenic animals that are 
capable of undergoing functional V-D-J gene rearrangement that 
incorporates all or a portion of a D gene segment which 
contributes to recombinational diversity. Further, there is a 

20 need in the art for transgenes and transgenic animals that can 
support V-D-J recombination and isotype switching so that (1) 
functional B cell development may occur, and (2) 
therapeutically useful heterologous antibodies may be 
produced. There is also a need for a source of B cells which 

25 can be used to make hybridomas that produce monoclonal 

antibodies for therapeutic or diagnostic use in the particular 
species for which they are designed. A heterologous 
immunoglobulin transgene capable of functional V-D-J 
recombination and^or capable of isotype switching could 

30 fulfill these needs. 

In. accordance with the foregoing object transgenic 
nonhuman animals are provided which are capable of producing a 
heterologous antibody, such as a human antibody. 

Further, it is an object to provide B-cells from 
35 such transgenic animals which are capable of expressing 

heterologous antibodies wherein such B-cells are immortalized 
to provide a source of a monoclonal antibody specific for a 
particular antigen. 



t 
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In accordance with this foregoing object, it is a 
further object of the invention to provide hybridoma cells 
that are capable of producing such heterologous monoclonal 
antibodies • 

Still further, it is an object herein to provide 
heterologous unrearranged and rearranged immunoglobulin heavy 
and light chain transgenes useful for producing the 
aforementioned non-human transgenic animals. 

Still further, it is an object herein to provide 
methods to disrupt endogenous immunoglobulin loci in the 
transgenic animals. 

Still further, it is an object herein to provide 
methods to induce heterologous antibody production in the 
aforementioned transgenic non-human animal. 

A further object of the invention is to provide 
methods to generate an immunoglobulin variable region gene 
segment repertoire that is used to construct one or more 
transgenes of the invention. 

The references discussed herein are provided solely 
for their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an 
admission that the inventors are not entitled to antedate such 
disclosure by virtue of prior invention. 



SUMMARY OF THE INVENTION 
Transgenic nonhuman animals are provided which are 
capable of producing a heterologous antibody, such as a human 
antibody ♦ Such heterologous antibodies may be of various 
isotypes, including: IgGl, IgG2, IgG3, lgG4 r IgM, igAl, IgA2, 
IgA sec , IgD, of IgE. In order for such transgenic nonhuman 
animals to make an immune response, it is necessary for the 
transgenic B cells and pre-B cells to produce surface-bound 
immunoglobulin, particularly of the IgM (or possibly IgD) 
isotype, in order to effectuate B cell development and 
antigen-stimulated maturation. Such expression of an IgM (or 
IgD) surface-bound immunoglobulin is only required during the 
antigen-stimulated maturation phase of B cell development, an 
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mature B cells may produce other isotypes, although only a 
Single switched isotype may be produced at a time. 

Typically, a cell of the B-cell lineage will produce 
only a single isotype at a time r although cis or trans 
alternative RNA splicing, such as occurs naturally with the M s 
(secreted n) and m m (membrane-bound n) forms, and the n and S 
immunoglobulin chains, may lead to the contemporaneous 
expression of multiple isotypes by a single cell. Therefore, 
in order to produce heterologous antibodies of multiple 
isotypes, specifically the therapeutically useful IgG, IgA, 
and IgE isotypes, it is necessary that isotype switching 
occur. Such isotype switching may be classical class- 
switching or may result from one or more non-classical isotype 

switching mechanisms. 

The invention provides heterologous immunoglobulin 
transgenes and transgenic nonhuman animals harboring such 
transgenes, wherein the transgenic animal is capable of 
producing heterologous antibodies of multiple isotypes by 
undergoing isotype switching, classical isotype switching 
occurs by recombination events which involve at least one 
switch sequence region in the transgene. Non-classical 
isotype switching may occur by, for example, homologous 
recombination between human and human sequences (£- 
associated deletion) . Alternative non-classical switching 
mechanisms, such as intertransgene and/ or interchromosomal 
recombination, among others, may occur and effectuate isotype 
switching. Such transgenes and transgenic nonhuman animals 
produce a first immunoglobulin isotype that is necessary for 
antigen-stimulated B cell maturation and can switch to encode 
and produce one or more subsequent heterologous isotypes that 
have therapeutic and/ or diagnostic utility. Transgenic 
nonhuman animals of the invention are thus able to produce, in 
one embodiment, IgG, IgA, and/or IgE antibodies that are 
encoded by human immunoglobulin genetic sequences and which 
also bind specific human antigens with high affinity. 

The invention also encompasses B-cells from such 
transgenic animals that are capable of expressing heterologous 
antibodies of various isotypes, wherein such B-cells are 
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immortalized to provide a source of a monoclonal antibody 
specific for a particular antigen. Hybridoma .cells that are 
derived from such B-cells can serve as one source of such 

< heterologous monoclonal antibodies. 

5 The invention provides heterologous unrearranged and 

* rearranged immunoglobulin heavy and light chain transgenes 

capable of undergoing isotype switching in vivo in the 
aforementioned non-human transgenic animals or in explanted 
lymphocytes of the B-cell lineage from such transgenic 
10 animals. Such isotype switching may occur spontaneously or be 
induced by treatment of the transgenic animal or explanted B- 
lineage lymphocytes with agents that promote isotype 
switching, such as T-cell-derived lymphokines (e.g., IL-4 and 
IFN y) . 

15 Still further, the invention includes methods to 

induce heterologous antibody production in the aforementioned 
transgenic non-human animal, wherein such antibodies may be of 
various isotypes. These methods include producing an antigen- 
stimulated immune response in a transgenic nonhuman animal for 

20 the generation of heterologous antibodies, particularly 

heterologous antibodies of a switched isotype (i.e., IgG, IgA, 
and IgE) . 

This invention provides methods whereby the 
transgene contains sequences that effectuate isotype 
25 switching, so that the heterologous immunoglobulins produced 
in the transgenic animal and monoclonal antibody clones 
derived from the B-cells of said animal may be of various 
isotypes. 

This invention further provides methods that 
30 facilitate isotype switching of the transgene, so that 
1 - switching between particular isotypes may occur at much higher 

or lower frequencies or in different temporal orders than 
£ typically occurs in germline immunoglobulin loci. Switch 

regions may be grafted from various C H genes and ligated to 
35 other C H genes in a transgene construct; such grafted switch 
sequences will typically function independently of the 
associated C H gene so that switching in the transgene 
construct will typically be a function of the origin of the 
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associated switch regions. Alternatively, or in combination 
with switch sequences, 5-associated deletion sequences may be 
linked to various C H genes to effect non-classical switching 
by deletion of sequences between two ^-associated deletion 
sequences. Thus, a transgene may be constructed so that a 
particular C H gene is linked to a different switch sequence 
and thereby is switched to more frequently than occurs when 
the naturally associated switch region is used. 

This invention also provides methods to determine 
whether isotype switching of transgene sequences has occurred 
in a transgenic animal containing an immunoglobulin transgene. 

The invention provides immunoglobulin transgene 
constructs and methods for producing immunoglobulin transgene 
constructs, some of which contain a subset of germline 
15 immunoglobulin loci sequences (which may include deletions) . 
The invention includes a specific method for facilitated 
cloning and construction of immunoglobulin transgenes, 
involving a vector that employs unique Xhol and Sail 
restriction sites flanked by two unique NotI sites. This 
20 method exploits the complementary termini of Xhol and Sail 

restrictions sites and is useful for creating large constructs 
by ordered concatemerization of restriction fragments in a 
vector. 

The transgenes of the invention include a heavy 
25 chain transgene comprising DNA encoding at least one variable 
gene segment, one diversity gene segment, one joining gene 
segment and one constant region gene segment. The 
immunoglobulin light chain transgene comprises DNA encoding at 
least one variable gene segment, one joining gene segment and 
30 one constant region gene segment. The gene segments encoding 
the light and heavy chain gene segments are heterologous to 
the transgenic non-human animal in that they are derived from, 
or correspond to, DNA encoding immunoglobulin heavy and light 
chain gene segments from a species not consisting of the 
35 transgenic non-human animal. In one aspect of the invention, 
the transgene is constructed such that the individual gene 
segments are unrearranged, i.e., not rearranged so as to 
encode a functional immunoglobulin light or heavy chain. Such 



WO 93/12227 PCT/US92/1 0983 



unrearranged transgenes permit recombination of .the gene 
segments (functional rearrangement) and expression of the 
resultant rearranged immunoglobulin heavy and/ or light chains 
I within the transgenic non-human animal when said animal is 

5 exposed to antigen. 

In one aspect of the invention, heterologous heavy 
and light immunoglobulin transgenes comprise relatively large 
fragments of unrearranged heterologous DNA. Such fragments 
typically comprise a substantial portion of the C, J (and in 
10 the ciase of heavy chain, D) segments from a heterologous 
immunoglobulin locus. In addition, such fragments also 
comprise a substantial portion of the variable gene segments. 

In one embodiment , such transgene constructs 
comprise regulatory sequences, e.g. promoters, enhancers, 
15 class switch regions, recombination signals and the like, 

corresponding to sequences derived from the heterologous DNA. 
Alternatively, such regulatory sequences may be incorporated 
into the transgene from the same or a related species of the 
non-human animal used in the invention. For example, human 
20 immunoglobulin gene segments may be combined in a transgene 
with a rodent immunoglobulin enhancer sequence for use in a 
transgenic mouse. 

In a method of the invention, a transgenic non-human 
animal containing germline unrearranged light and heavy 
25 immunoglobulin transgenes - that undergo VDJ joining during 
D-cell differentiation - is contacted with an antigen to 
induce production of a heterologous antibody in a secondary 
repertoire B— cell. 

Also included in the invention are vectors and 
30 methods to disrupt the endogenous immunoglobulin loci in the 
* non-human animal to be used in the invention. Such vectors 

and methods utilize a transgene, preferably positive-negative 
5 selection vector, which is constructed such that it targets 

the functional disruption of a class of gene segments encoding 
35 a heavy and/or light immunoglobulin chain endogenous to the 
non-human animal used in the invention. Such endogenous gene 
segments include diversity, joining and constant region gene 
segments. In this aspect of the invention, the 
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15 



20 



25 



positive-negative selection vector is contacted, with at least 
one embryonic stem cell of a non-human animal after which 
cells are selected wherein the positive-negative selection 
vector has integrated into the genome of the non-human animal 
by way of homologous recombination. After transplantation, 
the resultant transgenic non-human animal is substantially 
incapable of mounting an immunoglobulin-mediated immune 
response as a result of homologous integration of the vector 
into chromosomal DNA. Such immune deficient non-human animals 
may thereafter be used for study of immune deficiencies or 
used as the recipient of heterologous immunoglobulin heavy and 

light chain transgenes. 

The invention also provides vectors, methods, and 
compositions useful for suppressing the expression of one or 
more species of immunoglobulin chain (s ) , without disrupting an 
endogenous immunoglobulin locus. Such methods are useful for 
suppressing expression of one or more endogenous 
immunoglobulin chains while permitting the expression of one 
or more transgene-encoded immunoglobulin chains. Unlike 
genetic disruption of an endogenous immunoglobulin chain 
locus, suppression of immunoglobulin chain expression does not 
require the time-consuming breeding that is needed to 
establish transgenic animals homozygous for a disrupted 
endogenous Ig locus. An additional advantage of suppression 
as compared to engognous Ig gene disruption is that, in 
certain embodiments, chain suppression is reversible within an 
individual animal. For example, Ig chain suppression may be 
accomplished with: (1) transgenes encoding and expressing 
antisense SNA that specifically hybridizes to an endogenous Ig 
chain gene sequence, (2) antisense oligonucleotides that 
specifically hybridize to. an endogenous Ig chain gene 
sequence, and (3> immunoglobulins that bind specifically to an 
endogenous Ig chain polypeptide. 

The references discussed herein are provided solely 
for their disclosure prior to the filing date of the present 
application. Nothing herein is to be construed as an 
admission that the inventors are not entitled to antedate such 
disclosure by virtue of prior invention. 
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BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 depicts the complementarity determining 
regions CDR1, CDR2 and CDR3 and framework regions FR1 , FR2 , 
FR3 and FR4 in unrearranged genomic DNA and mRNA expressed 
from a rearranged immunoglobulin heavy chain gene, 
Fig. 2 depicts the human X chain locus, 
Fig. 3 depicts. the human k chain locus, 
Fig. 4 depicts the human heavy chain locus, 
Fig. 5 depicts a transgene construct containing a 
rearranged IgM gene ligated to a 25 kb fragment that contains 
human y3 and 7 1 constant regions followed by a 700 bp fragment 
containing the rat chain 3 1 enhancer sequence. 

Fig. € is a restriction map of the human k chain 
locus depicting the fragments to be used to form a light chain 
transgene by way of in vivo homologous recombination - 
Fig. 7 depicts the construction of pGPl. 
Fig. 8 depicts the construction of the polylinker 

contained in pGPl. 

Fig. 9 depicts the fragments used to construct a 
human heavy chain transgene of the invention. 

Fig. 10 depicts the construction of pHIGl and pCONl. 

Fig. 11 depicts the human C7I fragments which are 
inserted into pRE3 (rat enhancer 3 1 ) to form pREG2. 

Fig. 12 depicts the construction of pHIG3 • and PCON. 

Fig. 13 depicts the fragment containing human D 
region segments used in construction of the transgenes of the 
invention . 

Fig. 14 depicts the construction of pHIG2 (D segment, 
containing plasmid) • 

Fig. 15 depicts the fragments covering the human Jk 
and human Ck gene segments used in constructing a transgene of 
the invention. 

Fig. 16 depicts the structure of pEM- 

Fig. 17 depicts the construction of pKapH. 

Figs. 18A through 18D depict the construction of a. 
positive-negative selection vector for functionally disrupting 
the endogenous heavy chain immunoglobulin locus of mouse. 
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Figs. 19A through 19C depict the construction of 
positive-negative selection vector for functionally disrupting 
the endogenous immunoglobulin light -chain loci in mouse. 

Figs. 20 a through e depict the structure of a kappa 

5 light chain targeting vector. 

Figs. 21 a through f depict the structure of a mouse 

heavy chain targeting vector. 

Fig. 22 depicts the map of vector pGPe. 
Fig. 23 depicts the structure of vector pJM2. 
10 Fig. 24 depicts the structure of vector pCORl. 

Fig. 25 depicts the transgene constructs for pIGMl, 

pHCl and pHC2. 

Fig. 26 depicts the structure of p-ye2. 
Fig. 27 depicts the structure of pVGEl. 
15 Fig. 28 depicts the assay results of human Ig 

expression in a pHCl transgenic mouse. 

Fig,. 29 depicts the structure of pJCKl. 

Fig. 30 depicts the construction of a synthetic 

heavy chain variable region. 
20 Fig . 31 is a schematic representation of the-heavy 

chain minilocus constructs P IGM lf pHCl, and pHC2. 

Fig. 32 is a schematic representation of the heavy 
chain minilocus construct pIGGl and the k light chain 
minilocus construct pKCl, pKVel, and pKC2. 
25 P ig. 33 depicts a scheme to reconstruct functionally 

rearranged light chain genes. 

Fig. 34 depicts serum ELISA results 
Fig. 35 depicts the results of an ELISA assay of 
serum from 8 transgenic mice. 
30 pig. 36 is a schematic representation of plasmid 

pBCEl. 

Fig. 37 depicts the immune response of transgenic 
mice of the present invention against KLH-DNP, by measuring 
IgG and IgM levels specific for KLH-DNP (37A) , KLH (37B) and 

35 BSA-DNP (37C) . 

Fig. 38 shows ELISA data demonstrating the. presence 
of antibodies that bind human carcinoembryonic antigen (CEA) 
and comprise human m chains; each panel shows reciprocal 
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serial dilutions from pooled serum samples obtained from mice 
on the indicated day following immunization. 

Fig. 39 shows ELISA data demonstrating the presence 
of antibodies that bind human carcinoembryonic antigen (CEA) 
5 and comprise human y chains; each panel shows reciprocal 

serial dilutions from pooled serum samples obtained from mice 
on the indicated day following immunization. 

Fig, 40 shows aligned variable region sequences of 
23 randomly-chosen cDNAs generated from mRNA obtained from 

10 lymphoid tissue of HCl transgenic mice immunized, with human 
carcinoembryonic antigen (CEA) as compared to the germline 
transgene sequence (top line) ; on each line nucleotide changes 
relative to germline sequence are shown above the alteration 
in deduced amino acid sequence (if any); the regions 

15 corresponding to heavy chain CDRl, CDR2, and CDR3 are 

indicated. Non-germline encoded nucleotides are shown in 
capital letters. Germline V H 251 and J H are shown in lower case 
letters. Deduced amino acid changes are given beneath 
nucleotide sequences using th conventional single-letter 

20 notation. 

Fig. 41 shows the data from Fig. 40 in histogram 
format; deduced amino acid residue position is shown as the 
ordinate (left is the amino-terminal direction, right is in 
the direction towards the carboxy-terminus) and frequency of 
25 sequence variation is shown as the abscissa. 

Fig.. 42 show the nucleotide sequence of a human DNA 
fragment, designated vk65.3, containing a V K gene segment; the 
deduced amino acid sequences of the V K coding regions are also 
shown; splicing and recombination signal sequences 
30 (heptamer/nonamer) are shown boxed* 

Figr- 43 show the nucleotide sequence of a human DNA 
fragment/ designated vk65.5*, containing a V K gene segment; the 
deduced amino acid sequences of the V K coding regions are also 
shown; splicing and recombination signal sequences 
35 (heptamer/nonamer) are shown boxed. 

Fig. 44 show the nucleotide sequence of a human DNA 
fragment, designated vk65 . 8 , containing a V K gene segment; the 
deduced amino acid sequences of the V K coding regions are also 
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shown; splicing and recombination signal sequences 
(heptamer/nonamer) are shown boxed. 

Fig. 45 show the nucleotide sequence of a human DNA 
fragment, designated vk65.15, containing a V K gene segment; 
the deduced amino acid sequences of the V K coding regions are 
also shown; splicing and recombination signal sequences 
(heptamer/nonamer) are shown boxed. 

Fig. 46 shows formation of a light chain minilocus 
by homologous recombination between two overlapping fragments 

which were co- injected. 

Table 1 depicts the sequence of vector pGPe. 

Table 2 depicts the sequence of gene V H 4 : . 8 . 

Table 3 depicts the detection of human igM and IgG 
in the serum of transgenic mice of this invention. 

Table 4 depicts sequences of VDJ joints. 

Table 5 depicts the distribution of J segments 
incorporated into pHCl transgene encoded transcripts to J 
segments found in adult human peripheral blood lymphocytes 
(PBL) . 

Table 6 depicts the distribution of D segments 
incorporated into pHCl transgene encoded transcripts to D 
segments^found in adult human peripheral blood lymphocytes 
(PBL) . 

Table 7 depicts the length of the CDR3 peptides from 
transcripts with in-frame VDJ joints in the pHCl transgenic 

mouse and in human PBL. 

Table 8 depicts the predicted amino acid sequences 
of the VDJ regions from 30 clones analyzed from a pHCl 
transgenic. 

Table 9 shows transgenic mice of line 112 that were 
used in the indicated experiments; (+) indicates the presence 
of the respective transgene, ' (++) ir-icates that the animal is 
homozygous for the J H D knockout transgene. 

DETAILED DESCRIPTION 
As has been discussed supra, it is desirable to 
produce human immunoglobulins that are reactive with specific 
human antigens that are promising therapeutic and/or 
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diagnostic targets. However, producing human immunoglobulins 
that bind specifically with human antigens is problematic. 

First, the immunized animal that serves as the 
source of B cells must make an immune response against the 
5 presented antigen. In order for an animal to make an immune 
response, the antigen presented must be foreign and the animal 
must not be tolerant to the antigen. Thus, for example, if it 
is desired to produce a human monoclonal antibody with an 
idiotype that binds to a human protein, self -tolerance will 

10 prevent an immunized human from making a substantial immune 
response to the human protein, since the only epitopes of the 
antigen that may be immunogenic will be those that result from 
polymorphism of the protein within the human population 
(allogeneic epitopes) . 

15 Second, if the animal that serves as the source of 

B-cells for forming a hybridoma (a human in the illustrative 
given example) does make an immune response against an 
authentic self antigen, a severe autoimmune disease may result 
in the animal. Where humans would be used as a source of B- 

20 cells for a hybridoma, such autoimmunization would be 
considered unethical by contemporary standards. 

One methodology that can be used to obtain human 
antibodies that are specifically reactive with human antigens 
is the production of a transgenic mouse harboring the human 

25 immunoglobulin transgene constructs of this invention. 

Briefly, transgenes containing all or portions of the human 
immunoglobulin heavy and light chain loci, or transgenes 
containing synthetic "miniloci" (described infra, and in 
PCT/US91/06185 filed August 28, 1991) which comprise essential 

30 functional elements of the human heavy and light chain loci, 
* are employed to produce a transgenic nonhuman animal. Such a 

transgenic nonhuman animal' will have the capacity to produce 
immunoglobulin chains that are encoded by human immunoglobulin 
genes, and additionally will be capable of making an immune 

35 response against human antigens. Thus, such transgenic animals 
can serve as a source of immune sera reactive with specified 
human antigens, and B-cells from such transgenic animals can 
be fused with myeloma cells to produce hybridomas that secrete 
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monoclonal antibodies that are encoded by human immunoglobulin 
genes and which are specifically reactive with human antigens. 

The production of transgenic mice containing various 
forms of immunoglobulin genes has been reported previously. 
Rearranged mouse immunoglobulin heavy or light chain genes 
have been used to produce transgenic mice. In addition, 
functionally rearranged human Ig genes including the n or yl 
constant region have been expressed in transgenic mice. 
However, experiments in which the transgene comprises 
unrearranged (V-D-J or V-J not rearranged) immunoglobulin 
genes have been variable, in some cases, producing incomplete 
or minimal rearrangement of the transgene. However, there are 
no published examples of either rearranged or unrearranged 
immunoglobulin transgenes which undergo successful isotype 
switching between genes within a transgene. 



Definitions 

As used herein, the term "antibody" refers to a 
glycoprotein comprising at least two light polypeptide chains 
and two heavy polypeptide chains. Each of the heavy and light 
polypeptide chains contains a variable region (generally the 
amino terminal portion of the polypeptide chain) which 
contains a binding domain which interacts with antigen. Each 
of the heavy and light polypeptide chains also comprises a 
constant region of the polypeptide chains (generally the 
carboxyl terminal portion) which may mediate the binding of 
the immunoglobulin to host tissues or factors including 
various cells of the immune system, some phagocytic cells and 
the first component (Clg) of the classical complement system. 

As used herein, a "heterologous antibody" is defined 
in relation to the transgenic non-human organism producing 
such an antibody. It is defined as an antibody having an amino 
acid sequence or an encoding DNA sequence corresponding to 
that found in an organism not consisting of the transgenic 

non-human animal. 

As used herein, a "heterohybrid antibody" refers to 
an antibody having a light and heavy chains of different 
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organisraal origins. For example, an antibody having a human 
heavy chain associated with a murine light chain is a 
he t er ohy br id ant ibody . 

As used herein, "isotype" refers to the antibody 
5 class (e.g., IgM or IgG^ that is encoded by heavy chain 
constant region genes. 

As used herein, "isotype switching" refers to the 
phenomenon by which the class, or isotype, of an antibody 
changes from one Ig class to one of the other Ig classes, 

10 As used herein, "nonswitched isotype" refers to the 

isotypic class of heavy chain that is produced when no isotype 
switching has taken place; the C H gene encoding the 
nonswitched isotype is typically the first C H gene immediately 
downstream from the functionally rearranged VDJ gene, 

15 As used herein, the term "switch sequence" refers to 

those DNA sequences responsible for switch recombination, A 
"switch donor" sequence, typically a m switch region, will be 
5 1 (i.e., upstream) of the construct region to be deleted 
during the switch recombination. The "switch acceptor" region 

20 will be between the construct region to be deleted and the 
replacement constant region (e.g., y, e, etc.). As there is 
no specific site where recombination always occurs, the final 
gene sequence will typically not be predictable from the 
construct. 

25 As used herein, "glycosylation pattern" is defined 

as the pattern of carbohydrate units that are covalently 
attached to a protein, more specifically to an immunoglobulin 
protein. A glycosylation pattern of a heterologous antibody 
can be characterized as being substantially similar to 
30 glycosylation patterns which occur naturally on antibodies 
V produced by the species of the nonhuman transgenic animal, 

when one of ordinary skill in the art- would recognize the 
% glycosylation pattern of the heterologous antibody as being 

more similar to said pattern of glycosylation in the species 
35 of the nonhuman transgenic animal than to the species from 
which the C H genes of the transgene were derived. 

As used herein, "specific binding" refers to the 
property of the antibody: (l) to bind to a predetermined 
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antigen with an affinity of at least 1 x 10 7 IT 1 , and (2) to 
preferentially bind to the predetermined antigen with an 
affinity that is at least two-fold greater than its affinity 
for binding to a non-specific tntigen (e.g., BSA, casein) 
5 other than the predetermined antigen. 

The term "naturally-occurring" as used herein as 
applied to an object refers to the fact that an object can be 
found in nature. For example, a polypeptide or polynucleotide 
sequence that is present in an organism (including viruses) 

10 that can be isolated from a source in nature and which has not 
been intentionally modified by man in the laboratory is 
naturally-occurring. 

The term "rearranged" as used herein refers to a 
configuration of a heavy chain or light chain immunoglobulin 

15 locus wherein a V segment is positioned immediately adjacent 
to a D-J or J segment in a conformation encoding essentially a 
complete V H or V L domain, respectively. A rearranged 
immunoglobulin gene locus can be identified by comparison to 
germline DNA; a rearranged locus will have at least one 

20 recombined heptamer/nonamer homology element. 

The term "unrearranged" or "germline configuration" 
as used herein in reference to a V segment refers to the 
configuration wherein the V segment is not recombined so as to 
be immediately adjacent to a D or J segment. 



25 



Transgenic Nonhuman Animals Capable 
o-F Producing Hetero logous Antibodies 

The design of a transgenic non-human animal that 
30 responds to foreign antigen stimulation with a heterologous 
antibody repertoire, requires that the heterologous 
immunoglobulin transgenes contained within the transgenic 
animal function correctly throughout the pathway of B-cell 
development. In a preferred embodiment, correct function of a 
35 heterologous heavy chain transgene includes isotype switching. 
Accordingly, the transgenes of the invention are constructed 
so as to produce isotype switching and one or more of the 
following: (1) high level and cell-type specific expression, 
(2) functional gene rearrangement, (3) activation of and 
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response to allelic exclusion, (4) expression of a sufficient 
primary repertoire, (5) signal transduction, (6) somatic 
hypermutation, and (7) domination of the transgene antibody 
locus during the immune response. 
5 As will be apparent from the following disclosure, 

not all of the foregoing criteria need be met. For example, in 
those, embodiments wherein the endogenous immunoglobulin loci 
of the transgenic animal are functionally disrupted, the 
transgene need not activate allelic exclusion. Further, in 
10 those embodiments wherein the transgene comprises a 
functionally rearranged heavy and/or light chain 
immunoglobulin gene, the second criteria of functional gene 
rearrangement is unnecessary, at least for that transgene 
which is already rearranged. For background on molecular 
15 immunology, see . Fundamen tal immunology, 2nd edition (1989) , 
Paul William E. , ed. Raven Press, N.Y., which is incorporated 
herein by reference. 

In one aspect of the invention, transgenic non-human 
animals are provided that contain rearranged, unrearranged or 
20 a combination of rearranged and unrearranged heterologous 
immunoglobulin heavy and light chain transgenes in the 
germline of the transgenic animal. Each of the heavy chain 
transgenes comprises at least one C H gene. In addition, the 
heavy chain transgene may contain functional isotype switch 
25 sequences, which are capable of supporting isotype switching 
of a heterologous transgene encoding multiple C H genes in B- 
cells of the transgenic animal. Such switch sequences may be 
those which occur naturally in the germline immunoglobulin 
locus from the species that serves as the source of the 
30 transgene C H genes, or such switch sequences may be derived 
V from those which occur in the species that is to receive the 

transgene construct (the transgeneic animal) . For example, a 
as human transgene construct that is used to produce a transgenic 

mouse may produce a higher frequency of isotype switching 
35 events if it incorporates switch sequences similar to those 
that occur naturally in the mouse heavy chain locus, as 
presumably the mouse switch sequences are optimized to 
function with the mouse switch recombinase enzyme system, 
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whereas the human switch sequences are not. Switch sequences 
made be isolated and cloned by conventional cloning methods, 
or may be synthesized de novo from overlapping synthetic 
oligonucleotides designed on the basis of published sequence 
5 information relating to immunoglobulin switch region sequences 
(Mills et al., Acids Res . 18:7305-7316 (1991); 

Sideras et al. , ttvM . Immunol . It 631-642 (1989), which are 
incorporated herein by reference) . 

For each of the foregoing transgenic animals, 

10 functionally rearranged heterologous heavy and light chain 

immunoglobulin transgenes are found in a significant fraction 
of the B-cells of the transgenic animal (at least 10 percent). 

The transgenes of the invention include a heavy 
chain transgene comprising DNA encoding at least one variable 

15 gene segment, one diversity gene segment, one joining gene 
segment and at least one constant region gene segment. The 
immunoglobulin light chain transgene comprises DNA encoding at 
least one variable gene segment, one joining gene segment and 
at least one constant region gene segment. The gene segments 

20 encoding the light and heavy chain gene segments are 

heterologous to the transgenic non-human animal in that they 
are derived from, or correspond to, DNA encoding 
immunoglobulin heavy and light chain gene segments from a 
species not consisting of the transgenic non-human animal. In 

25 one aspect of the invention, the transgene is constructed such 
that the individual gene segments are unrearranged, i.e., not 
rearranged so as to encode a functional immunoglobulin light 
or heavy chain. Such unrearranged transgenes support 
recombination of the V, D, and J gene segments (functional 

30 rearrangement) and preferably support incorporation of all or 
a portion of a D region gene segment in the resultant 
rearranged immunoglobulin heavy chain within the transgenic 
non-human animal when exposed to antigen. 

In an alternate embodiment, the transgenes comprise 

35 an unrearranged "mini- locus" . Such transgenes typically 

comprise a substantial portion of the C, D, and J segments as 
well as a subset of the V gene segments. In such transgene 
constructs, the various regulatory sequences, e.g. promoters, 



WO 93/12227 



PCT/US92/10983 



21 

enhancers, class switch regions, splice-donor and splice- 
acceptor sequences for RNA processing, recombination signals 
and the like, comprise corresponding sequences derived from 
the heterologous DNA. Such regulatory sequences may be 
5 incorporated into the transgene from the same or a related 
species of the non-human animal used in the invention. For 
example, human immunoglobulin gene segments may be combined in 
a transgene with a rodent immunoglobulin enhancer sequence for 
use in a transgenic mouse. Alternatively, synthetic regulatory 

10 sequences may be incorporated into the transgene, wherein such 
synthetic regulatory sequences are not homologous to a 
functional DNA sequence that is known to occur naturally in 
the genomes of mammals. Synthetic regulatory sequences are 
designed according to consensus rules, such as, for example, 

15 those "specifying the permissible sequences of a splice- 
acceptor site or a promoter/ enhancer motif. 

The invention also includes transgenic animals 
containing germ line cells having a heavy and light transgene 
wherein one of the said transgenes contains rearranged gene 

20 segments with the other containing unrearranged gene segments. 
In the preferred embodiments, the rearranged transgene is a 
light chain immunoglobulin transgene and the unrearranged 
transgene is a heavy chain immunoglobulin transgene. 



25 The Structure and Generation of Antibodies 

The basic structure of all immunoglobulins is based 
upon a unit consisting of two light polypeptide chains and two 
heavy polypeptide chains. Each light chain comprises two 

30 regions known as the variable light chain region and the 

constant light chain region. Similarly, the immunoglobulin 
heavy chain comprises two regions designated the variable 
heavy chain region and the constant heavy chain region. 

The constant region for the heavy or light chain is 

35 encoded by genomic sequences referred to as heavy or light 
constant region gene (C H ) segments. The use of a particular 
heavy chain gene segment defines the class of immunoglobulin. 
For example, in humans, the /i constant region gene segments 
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define the igM class of antibody whereas the use of a y, y2 , 
• 7 3 or T 4 constant region gene segment defines the IgG class of 
antibodies as well as the IgG subclasses IgGl through IgG4. 
Similarly, the use of a a x or a 2 constant region gene segment 
defines the IgA class of antibodies as well as the subclasses 
IgAl and IgA2. The S and e constant region gene segments 
define the IgD and IgE antibodv classes, respectively. 

The variable regions of the heavy and light 
immunoglobulin chains together contain the antigen binding 
domain of the antibody. Because of the need for diversity in 
this region of the antibody to permit binding to a wide range 
of antigens, the DNA encoding the initial or primary 
repertoire variable region comprises a number of different DNA 
segments derived from families of specific variable region 
15 gene segments. In the case of the light chain variable 

region, such families comprise variable (V) gene segments and 
joining (J) gene segments. Thus, the initial variable region 
of the light chain is encoded by one V gene segment and one J 
gene segment each selected from the family of V and J gene 
20 segments contained in the genomic DNA of the organism. In the 
case of the heavy chain variable region, the DNA encoding the 
initial or primary repertoire variable region of the heavy 
chain comprises one heavy chain V gene segment, one heavy 
chain diversity (D) gene segment and one J gene segment, each 
25 selected from the appropriate V, D and J families of 
immunoglobulin gene segments in genomic DNA. 

in order to increase the diversity of sequences that 
contribute to forming antibody binding sites, it is preferable 
that a heavy chain transgene include cis-acting sequences that 
30 support functional V-D-J rearrangement that can incorporate 
all or part of a D region gene sequence in a rearranged V-D-J 
gene sequence. Typically, at least about 1 percent of 
expressed transgene-encoded heavy chains (or mRNAsJ include 
recognizable D region sequences in the V region. Preferably, 
35 at least about 10 percent of transgene-encoded V regions 

include recognizable D region sequences, more preferably at 
least about 30 percent, and most preferably more than 50 
percent include recognizable D region sequences. 
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A recognizable D region sequence is generally at 
least about eight consecutive nucleotides corresponding to a 
sequence present in a D region gene segment of a heavy chain 
transgene and/or the amino acid sequence encoded by such D 
region nucleotide sequence. For example, if a transgene 
includes the D region gene DHQ52, a transgene-encoded mRNA 
containing the sequence 5 1 -TAACTGGG-3 1 located in the V region 
between a V gene segment sequence and a J gene segment 
sequence is recognizable as containing a D region sequence, 
specifically a DHQ52 sequence. Similarly, for example, if a 
transgene includes the D region gene DHQ52, a transgene- 
encoded heavy chain polypeptide containing the amino acid 
sequence -DAF- located in the V region between a V gene 
segment amino acid sequence and a J gene segment amino acid 
15 sequence is recognizable as containing a D region sequence, 
specifically a DHQ52 sequence. 

However, because of somatic mutation and N-region 
addition, some D region sequences may be recognizable but may 
not correspond identically to a consecutive D region sequence 
20 in the transgene. For example, a nucleotide sequence 5 1 - 

CTAAXTGGGG-3 1 , where X is A, T, or G, and which is located in 
a heavy chain V region and flanked by a V region gene sequence 
and a J region gene sequence, can be recognized as 
corresponding to the DHQ52 sequence 5 1 -CTAACTGGG-3 1 . 
25 Similarly, for example, the polypeptide sequences -DAFDI-, 

-DYFDY-, or -GAFDI- located in a V region and flanked on the 
amino-terminal side by an amino acid sequence encoded by a 
transgene V gene sequence and flanked on the carboxyterminal 
side by an amino acid sequence encoded by a transgene J gene 
30 sequence is recognizable as a D region sequence. 

Therefore, because somatic mutation and N-region 
addition can produce mutations in sequences derived from a 
transgene D region, the following definition is provided as a 
guide for determining the presence of a recognizable D region 
35 sequence. An amino acid sequence or nucleotide sequence is 
recognizable as a D region sequence if: (1) the sequence is 
located in a V region and is flanked on one side by a V gene 
sequence (nucleotide sequence or deduced amino acid sequence) 
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and on the other side by a J gene sequence (nucleotide 
sequence or deduced amino acid sequence) and (2) the sequence 
is substantially identical or substantially similar to a known 
D gene sequence (nucleotide sequence or encoded amino acid 
sequence) . 

The term "substantial identity" as used herein 
denotes a characteristic of a polypeptide sequence or nucleic 
acid sequence, wherein the polypeptide sequence has at least 
50 percent sequence identity compared to a reference sequence, 
and the nucleic acid sequence has at least 70 percent sequence 
identity compared to a reference sequence. The percentage of 
sequence identity is calculated excluding small deletions or 
additions which total less than 3 5 percent of the reference 
sequence. The reference sequence may be a subset of a larger 
15 sequence, such as an entire D gene; however, the reference 
sequence is at least 8 nucleotides long in the case of 
polynucleotides, and at least 3 amino residues long in the 
case of a polypeptide. Typically, the reference sequence is 
at least 8 to 12 nucleotides or at least 3 to 4 amino acxds, 
20 and preferably the reference sequence is 12 to 15 nucleotides 
or more, or at least 5 amino acids. 

The term "substantial similarity" denotes a 
characteristic of an polypeptide sequence, wherein the 
polypeptide sequence has at least 80 percent similarity to a 
25 reference sequence. The percentage of sequence similarity is 
calculated by scoring identical amino acids or positional 
conservative amino acid substitutions as similar. A 
positional conservative amino acid substitution is one that 
can result from a single nucleotide substitution; a first 
30 amino acid is replaced by a second amino acid where a codon 

for the first amino acid and. a codon for the second amino acid 
can differ by a single nucleotide substitution. Thus, for 
example, the sequence -Lys-Glu-Arg-Val- is substantially 
similar to the sequence -Asn-Asp-Ser-Val- , since the codon 
35 sequence — AAA— GAA—AGA-GUU— can be mutated to -AAC-GAC-AGC-GUU- 
by introducing only 3 substitution mutations, single 
nucleotide substitutions in three of the four original codons. 
The reference sequence may be a subset of a larger sequence, 
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such as an entire D gene; however, the reference sequence is 
at least 4 amino residues long. Typically, the reference 
sequence is at least 5 amino acids, and preferably the 
reference sequence is 6 amino acids or more. 

5 

The Primary Repertoire 

The process for generating DNA encoding the heavy 
and light chain immunoglobulin genes occurs primarily in 

10 developing B-cells. Prior to the joining of various 

immunoglobulin gene segments, the V, D, J and constant (C) 
gene segments are found, for the most part, in clusters of V, 
D, J and C gene segments in the precursors of primary 
repertoire B-cells. Generally, all of the gene segments for a 

15 heavy or light chain are located in relatively close proximity 
on a single chromosome. Such genomic DNA prior to 
recombination of the various immunoglobulin gene segments is 
referred to herein as "unr ear ranged" genomic DNA. During 
B-cell differentiation, one of each of the appropriate family 

20 members of the V, D, J (or only V and J in the case of light 
chain genes) gene segments are recombined to form functionally 
rearranged heavy and light immunoglobulin genes. Such 
functional rearrangement is of the variable region segments to 
form DNA encoding a functional variable region. This gene 

25 segment rearrangement process appears to be sequential. 

First, heavy chain D-to-J joints are made, followed by heavy 
chain V-to-DJ joints and light chain V-to-J joints. The DNA 
encoding this initial form of a functional variable region in 
a light and/or heavy chain is referred to as "functionally 

30 rearranged DNA" or "rearranged DNA". In the case of the heavy 
chain, such DNA. is referred to as "rearranged heavy chain DNA" 
and in the case of the light chain, such DNA is referred to as 
"rearranged light chain DNA". Similar language is used to 
describe the functional rearrangement of the transgenes of the 

35 invention. 

The recombination of variable region gene segments 
to form functional heavy and light chain variable regions is 
mediated by recombination signal sequences (RSS's) that flank 
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RSS ' s 



15 



20 



25 



recombinationally competent V, D and J segments 
necessary and sufficient to direct recombination, comprise a 
dyad-symmetric heptamer, an AT-rich nonamer and an intervening 
spacer region of either 12 or 23 base pairs. These signals 
are conserved among the different loci and species that carry 
out D-J (or V-J) recombination and are functionally 
interchangeable. See Oettinger, et al. (1990) , Science, 248, 
1517-1523 and references cited therein. The heptamer 
comprises the sequence CACAGTG or its analogue followed by a 
spacer of unconserved sequence and then a nonamer havxng the 
sequence ACAAAAACC or its analogue. These sequences are found 
on the J, or downstream side, of each V and D gene segment, 
immediately preceding the germline D and J segments are agaxn 
two recombination signal sequences, first the nonamer and then 
the heptamer again separated by an unconserved sequence. The 
heptameric and nonameric sequences following a V L , V H or D 
segment are complementary to those preceding the J L , D or J H 
segments with which they recombine. The spacers between the 
heptameric and nonameric sequences are either 12 base paxrs 
long or between 22 and 24 base pairs long. 

In addition to the rearrangement of V, D and J 
segments, further diversity is generated in the primary 
repertoire of immunoglobulin heavy and light chain by way of 
variable recombination between the V and J segments xn the 
light chain and between the D and J segments of the heavy 
chain. Such variable recombination is generated by variation 
in the exact place at which such segments are joined. Such 
variation in the light chain typically occurs within the last 
codon of the V gene segment and the first codon of the J 
30 segment. Similar imprecision in joining occurs on the heavy 
chain chromosome between the D and J H segments and may extend 
over as many as 10 nucleotides. Furthermore, several 
nucleotides may be inserted between the D and J H and between 
the V H and D gene segments which are not encoded by genomxc 
35 DNA. The addition of these nucleotides is known as N-regxon 

diversity. . 

After VJ and/ or VDJ rearrangement, transcrxptxon of 

the rearranged variable region and one or more constant region 
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gene segments located downstream from the rearranged variable 
region produces a primary RNA transcript which upon 
appropriate RNA splicing results in an mRNA which encodes a 
full length heavy or light immunoglobulin chain. Such heavy 
5 and light chains include a leader signal seguence to effect 
secretion through and/ or insertion of the immunoglobulin into 
the transmembrane region of the B-cell. The DNA encoding, this 
signal sequence is contained within the first exon of the V 
segment used to form the variable region of the heavy or light 

10 immunoglobulin chain. Appropriate regulatory sequences are 

also present in the mRNA to control translation of the mRNA to 
produce the encoded heavy and light immunoglobulin 
polypeptides which upon proper association with each other 
form an antibody molecule. 

15 Tne net effect of such rearrangements in the 

variable region gene segments and the variable recombination 
which may occur during such joining, is the production of a 
primary antibody repertoire. Generally, each B-cell which has 
differentiated to this stage, produces a single primary 

20 repertoire antibody. During this differentiation process, 
cellular events occur which suppress the functional 
rearrangement of gene segments other than those contained 
within the functionally rearranged Ig gene. The process by 
which diploid B-cells maintain such mono-specificity is termed 

25 allelic exclusion. 

The Secondary R epertoire 

B-cell clones expressing immunoglobulins from within 
30 the set of sequences comprising the primary repertoire are 

immediately available to respond to foreign antigens. Because 
of the limited diversity generated by simple VJ and VDJ 
joining, the antibodies produced by the so-called primary 
response are of relatively low affinity. Two different types 
35 of B-cells make up this initial response: precursors of 

primary antibody-forming cells and precursors of secondary 
repertoire B-cells (Linton et al., Cell 59:1049-1059 (1989)). 
The first type of B-cell matures into igM-secreting plasma 
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cells in response to certain antigens. The other B-cells 
respond to initial exposure to antigen by entering a T-cell 
dependent maturation pathway. 

During the T-cell dependent maturation of antigen 
stimulated B-cell clones, the structure of the antibody 
molecule on the cell surface changes in two ways: the «mstant 
region switches to a non-IgM subtype and the sequence of the 
variable region can be modified by multiple single amino acid 
substitutions to produce a higher affinity antibody molecule. 

As previously indicated, each variable regxon of a 
heavy or light Ig chain contains an antigen binding domaxn. 
It has been determined by amino acid and nucleic acxd 
sequencing that somatic mutation during the secondary response 
occurs throughout the V region including the three 
complementary determining regions (CDR1, CDR2 and CDR3) also 
referred to as hypervariable regions 1. 2 and 3 (Rabat et al. 
c - T ,o^ of g r ~»-«~ <** Immunol orricaT Interest (1991) U.S. 
Department of Health and Human Services, Washington, DC, 
incorporated herein by reference. The CDR1 and CDR2 are 
located within the variable gene segment whereas the CDR3 xs 
largely the result of recombination between V and J gene 
segments or V, D and J gene segments. Those portions of the 
variable region which do not consist of CDR1, 2 or 3 are 
commonly referred to as framework regions designated FR1, FR2, 
FR3 and FR4. see Fig. 1. During hypermutation , the 
rearranged DNA is mutated to give rise to new clones wxth 
altered Ig molecules. Those clones with higher affxnxtxes for 
the foreign antigen are selectively expanded by helper 
T-cells, giving rise to affinity maturation of the expressed 
antibody. Clonal selection typically results in expressxon of 
clones containing new mutation within the CDR1, 2 and/or 3 
• regions. However, mutations outside these regions also occur 
which influence the specificity and affinity of the antigen 
binding domain • 
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Transgenic Noh-Human Animals Capable 
of Produ cing Heterologous Antibody 

Transgenic non-human animals in one aspect of the 
5 invention are produced by introducing at least one of the 
immunoglobulin transgenes of the invention (discussed 
hereinafter) into a zygote or early embryo of a non-human 
animal. The non-human animals which are used in the invention 
generally comprise any mammal which is capable of rearranging 
10 immunoglobulin gene segments to produce a primary antibody 
response. Such nonhuman transgenic animals may include, for 
example, transgenic pigs, transgenic rats, transgenic rabbits, 
transgenic cattle, and other transgenic animal species, 
particularly mammalian species, known in the art. A 
15 particularly preferred non-human animal is the mouse or other 
members of the rodent family. 

However, the invention is not limited to the use of 
mice. Rather, any non-human mammal which is capable of 
mounting a primary and secondary antibody response may be 
20 used. Such animals include non-human primates, such as 

chimpanzee, bovine, ovine, and porcine species, other members 
of the rodent family, e.g. rat, as well as rabbit and guinea 
pig. Particular preferred animals are mouse, rat, rabbit and 
guinea pig, most preferably mouse. 
25 In one embodiment of the invention, various gene 

segments from the human genome are used in heavy and light 
chain transgenes in an unrearranged form. In this embodiment, 
such transgenes are introduced into mice. The unrearranged 
gene segments of the light and/ or heavy chain transgene have 
30 DNA sequences unique to the human species which are 

distinguishable from the endogenous immunoglobulin gene 
segments in the mouse genome. They may be readily detected in 
unrearranged form in the germ line and somatic cells not 
consisting of B-cells and in rearranged form in B-cells. 
35 in an alternate embodiment of the invention, the 

transgenes comprise rearranged heavy and/ or light 
immunoglobulin transgenes. Specific segments of such 
transgenes corresponding to functionally rearranged VDJ or VJ 
segments, contain immunoglobulin DNA sequences which are also 
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clearly distinguishable from the endogenous immunoglobulin 
gene segments in the mouse. 

Such differences in DNA sequence are also reflected 
in the amino acid sequence encoded by such human 
immunoglobulin transgenes as compared to those encoded by 
mouse B-cells. Thus, human immunoglobulin ammo acid 
sequences may be detected in the transgenic non-human animals 
of the invention with antibodies specific for immunoglobulin 
epitopes encoded by human immunoglobulin gene segments. 

Transgenic B-cells containing unrearranged 
transgenes from human or other species functionally recombine 
the appropriate gene segments to form functionally rearranged 
light and heavy chain variable regions. It will be readily 
apparent that the antibody encoded by such rearranged 
transgenes has a DNA and/or amino acid sequence which is 
heterologous to that normally encountered in the nonhuman 
animal used to practice the invention. 



nnrearra^npd Transgenes 

As used herein, an "unrearranged immunoglobulxn 
heavy chain transgene" comprises DNA encoding at least one 
variable gene segment, one diversity gene segment, one fining 
gene segment and one constant region gene segment. Each of 
the gene segments of said heavy chain transgene are derived 
from, or has a sequence corresponding to, DNA encoding 
immunoglobulin heavy chain gene segments from a species not 
consisting of the non-human animal into which said transgene 
is introduced. Similarly, as used herein, an "unrearranged 
immunoglobulin light chain transgene" comprises DNA encoding 
at least one variable gene segment, one joining gene segment 
and at least one constant region gene segment wherein each 
gene segment of said light chain transgene is derived from, or 
has a sequence corresponding to, DNA encoding immunoglobulin 
light chain gene segments from a species not consisting of the 
non-human animal into which said light chain transgene is 
introduced. 

Such heavy and light chain transgenes in this aspect 
of the invention contain the above-identified gene segments in 
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an unrearranged form. Thus, interposed between the V, D and J 
segments in the heavy chain transgene and between the V and J 
segments on the light chain transgene are appropriate 
recombination signal sequences (RSS's). In addition, such 
5 transgenes also include appropriate RNA splicing signals to 
join a constant region gene segment with the VJ or VDJ 
rearranged variable region. 

In order to facilitate isotype switching within a 
heavy chain transgene containing more than one C region gene 

10 segment, e.g. C/z and C7I from the human genome, as explained 
below "switch regions" are incorporated upstream from each of 
the constant region gene segments and downstream from the 
variable region gene segments to permit recombination between 
such constant regions to allow for immunoglobulin class 

15 switching, e.g. from IgM to IgG. Such heavy and light 

immunoglobulin transgenes also contain transcription control 
sequences including promoter regions situated upstream from 
the variable region gene segments which typically contain TATA 
motifs. A promoter region can be defined approximately as a 

20 DNA sequence that, when operably linked to a downstream 
sequence, can produce transcription of the downstream 
sequence. Promoters may require the presence of additional 
linked cis-acting sequences in order to produce efficient 
transcription. In addition, other sequences that participate 

25 in the transcription of sterile transcripts are preferably 
included. Examples of sequences that participate in 
expression of sterile transcripts can be found in the 
'published literature, including Rothman et al., Intl. Immunol,* 
2.:621-627 (1990); Reid et al., Proc. Natl. Acad. Sci. USA 

30 86:340-844 (1989); Stavnezer et al., Proc. Natl. Acad. Sci. 
USA 85; 7704-7708 (1988); and Mills et al., Nucl. Acids Res. 
18:7305-7316 (1991) , each of which is incorporated herein by 
reference. These sequences typically include about at least 
50 bp immediately upstream of a switch region, preferably 

35 about at least 200 bp upstream of a switch region; and more 
preferably about at least 200-1000 bp or more upstream of a 
switch region. Suitable sequences occur immediately upstream 
of the human. S 7l , S y2 , S y3 , S y4 , S al , S^, and S e switch 



WO 93/12227 



PCT/US92/10983 



32 



regions, although the sequences immediately upstream of the 
human S yl , and switch regions are preferable. In 
particular, interferon (IFN) inducible transcriptional 
regulatory elements, such as iFN-inducible enhancers, are 
5 preferably included immediately upstream of transgene switch 
sequences . 

in addition to promoters, other regulatory sequences 
which function primarily in B-lineage cells are used. Thus, 
for example, a light chain enhancer sequence situated 

10 preferably between the J and constant region gene segments on 
the light chain transgene is used to enhance transgene 
expression, thereby facilitating allelic exclusion. In the 
case of the heavy chain transgene, regulatory enhancers and 
also employed. Such regulatory sequences are used to maximize 

15 the transcription and translation of the transgene so as to 

induce allelic exclusion and to provide relatively high levels 

of transgene expression. 

Although the foregoing promoter and enhancer 
regulatory control sequences have been generically described, 

20 such regulatory sequences may be heterologous to the nonhuman 
animal being derived from the genomic DNA from which the 
heterologous transgene immunoglobulin gene segments are 
obtained. Alternately, such regulatory gene segments are 
derived from the corresponding regulatory sequences in the 

25 genome of the non-human animal, or closely related species, 
which contains the heavy and light transgene. 

In the preferred embodiments, gene segments are 
derived from human beings. The transgenic non-human animals 
harboring such heavy and light transgenes are capable of 

30 mounting an Ig-mediated immune response to a specific antigen 
administered to such an animal. " B-cells are produced within 
such an animal which are capable of producing heterologous 
human antibody. After immortalization, and the selection for 
an appropriate monoclonal antibody (Mab) , e.g. a hybndoma, a 

35 source of therapeutic human monoclonal antibody is provided, 
such human Mabs have significantly reduced immunogenicity when 
therapeutically administered to humans. 
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Although the preferred embodiments disclose the 
construction of heavy and light transgenes containing human 
gene segments, the invention is not so limited. In this 
regard, it is to be understood that the teachings described 
herein may be readily adapted to utilize immunoglobulin gene 
segments from a species other than human beings. For example, 
in addition to the therapeutic treatment of humans with the 
antibodies of the invention, therapeutic antibodies encoded by 
appropriate gene segments may be utilized to generate 
monoclonal antibodies for use in the veterinary sciences. 



Bparranned Transgenes 

In an alternative embodiment, transgenic nonhuman 
animals contain functionally at least one rearranged 

15 heterologous heavy chain immunoglobulin transgene in the 
germline of the transgenic animal. Such animals contain 
primary repertoire B-cells that express such rearranged heavy 
transgenes. Such B-cells preferably are capable of undergoing 
somatic mutation when contacted with an antigen to form a 

20 heterologous antibody having high affinity and specificity for 
the antigen. Said rearranged transgenes will contain at least 
two Ch genes and the associated sequences required for isotype 
switching. 

The invention also includes transgenic animals 
25 containing germ line cells having heavy and light transgenes 
wherein one of the said transgenes contains rearranged gene 
segments with the other containing unrearranged gene segments. 
In such animals, the heavy chain transgenes shall have at 
least two Ch genes and the associated sequences required for 

30 isotype switching. 

The invention further includes methods for 
generating a synthetic variable region gene segment repertoire 
. to be used in the transgenes of the invention. The method 
comprises generating a population of immunoglobulin V segment 
35 DNAs wherein each of the V segment DNAs encodes an 

immunoglobulin V segment and contains at each end a cleavage 
recognition site of a restriction endonuclease. The 
population of immunoglobulin V segment DNAs is thereafter 
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concatenated to form the synthetic immunoglobulin V segment 
repertoire. Such synthetic variable region heavy chain 
transgenes shall have at least two C H genes and the associated 
sequences required for isotype switching. 

5 

Tsotvoe swi tching 

in the development of a B lymphocyte, the cell 
initially produces IgM with a binding specificity determined 
by the productively rearranged V H and V L regions, 

10 subsequently, each B cell and its progeny cells synthesize 

antibodies with the same L and H chain V regions, but they may 
switch the isotype of the H chain. 

The use of n or S constant regions is largely 
determined by alternate splicing, permitting IgM and IgD to be 

15 coexpressed in a single cell. The other heavy chain isotypes 
(Y , a, and e) are only expressed natively after a gene 
rearrangement event deletes the cp and CS exons. This gene 
rearrangement process, termed isotype switching, typically 
occurs by recombination between so called switch segments 

20 located immediately upstream of each heavy chain gene (except 
5) . The individual switch segments are between 2 and 10 kb in 
length, and consist primarily of short repeated sequences. 
The exact point of recombination differs for individual class 
switching events. Investigations which have used solution 

25 hybridization kinetics or Southern blotting with cDNA-derived 
Ch probes have confirmed that switching can be associated with 
loss of C H sequences from the cell. 

The switch (S) region of -che p. gene, S„, is located 
about 1 to 2 kb 5- to the coding sequence and is composed of 

30 numerous tandem repeats of sequences of the form 

(GAGCT) -(GGGGT), where n is usually 2 to 5 but can range as 
high as 17. (See T. Nikaido et al. liature 292:845-848 (1981)) 

Similar internally repetitive switch sequences 
spanning several kilobases have been found 5' of the other Cfe 
35 genes. The Set region has been sequenced and found to consist 
of tandemly repeated 80-bp homology units, whereas S y2a , S v2b , 
and S Y3 all contain repeated 49-bp homology units very similar 
to each other. (See, P. Szurek et al. , J. Immunol 135:620-626 
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(1985) and T. Nikaido et al. , J. Biol^ Chem. 257:7322-7329 
(1982), which are incorporated herein by reference.) All the 
sequenced S regions include numerous occurrences of the 
pentamers GAGCT and GGGGT that are the basic repeated elements 
5 of the gene (T . Nikaido et al., J. Biol. Chem. 257:7322- 
7329 (1982) which is incorporated herein by reference) ; in the 
other S regions these pentamers are not precisely tandemly 
repeated as in S^, but instead are embedded in larger repeat 
units. The S yl region has an additional higher-order 
10 structure: two direct repeat sequences flank each of two 

clusters of 49-bp tandem repeats. ( See M. R. Mowatt et al., 
J. Immunol. 13,6:2674-2683 (1986), which is incorporated herein 
by reference) . 

Switch regions of human H chain genes have been 
15 found to be very similar to their mouse homologs. Indeed, 

similarity between pairs of human and mouse clones 5' to the 
Cjj genes has been found to be confined to the S regions, a fact 
that confirms the biological significance of these regions. 
A switch recombination between M and a genes 
20 produces a composite S^S a sequence. Typically, there is no 
specific site, either in S p or in any other S region, where 
the recombination always occurs. 

Generally, unlike the enzymatic machinery of V-J 
recombination, the switch machinery can apparently accommodate 
25 different alignments of the repeated homologous regions of 

germline S precursors and then join the sequences at different 
positions within the alignment. (See, T. H. Rabbits et al., 
Nucleic Acids Res . 9:4509-4524 (1981) and J. Ravetch et al. , 
Proc. Natl. Acad. Sci. USA 77:6734-6738 (1980), which are 
30 incorporated herein by reference.) 

The exact details of the mechanism (s) of selective 
activation of switching to a particular isotype are unknown. 
Although exogenous influences such as lymphokines and 
cytokines might upregulate isotype-specif ic recombinases , it 
35 is also possible that the same enzymatic machinery catalyzes 
switches to all isotypes and that specificity lies in 
targeting this machinery to specific switch regions. 
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The T-cell-derived lymphokines IL-4 and IFN Y have 
been shown to specifically promote the expression of certain 
isotypes: IL-4 decreases IgM, IgG2a, IgG2b, and IgG3 
expression and increases IgE and IgGl expression; while IFN y 
5 selectively stimulates IgG2a expression and antagonx.es the 
IL -4-induced increase in IgE and IgGl expression (Coffman et 
al., .t. Immunol- 136:949-954 (1986) and Snapper et al 
science 236l 944-947 (1987), which are incorporated here n by 
reference) . A combination of IL-4 and IL-5 promotes igA 
reference), a com*, Tmnuno i 139:3685-3690 (1987), 

LO expression (Coffman et al., J. Tmmunol . iia-jo 
which is incorporated herein by reference) . 

Most of the experiments implicating T-cell effects 
on switching have not ruled out the possibility that the 
observed increase in cells with particular switch 
15 recombinations might reflect selection of preswitched or_ 
precommitted cells; but the most likely explanation is that 
the lymphokines actually promote switch recombination. 

induction of class switching appears to be 
associated with sterile transcripts that 
20 the switch segments (Lutzker et al. , M oT oil- ^ 

(1988); Stavnezer et al. , P.n, Natl »cd ■ .W , USA 15.7704 

1988); Esser and Radbruch, MS*-*- W«> » B ^ 0n f 

„ . TTCS Qfi-7829 fl989) ; Rothman et 

al rr— w«-Kl . Acad. Stii. USA 86.26^ ^ J -=' 0 " 

^ ri990l each of which is incorporated 

al int. Immunol. 2.: 621 (199oj , eaui 

25 ^ JL.,. Pe r example, the observed induction of the T l 
sterile transcript by IW and inhibition by IFK-, correlate, 
with the observation that IL-4 promotes class switching to yl 
in B-cells in culture, while IFH t inhibits T l 
Therefore, the inclusion of regulatory sequence, that affect 

30 the transcription of sterile transcripts nay also affect the 
rate of isotype switching. For example, ^"""'^ oan 
transcription of a particular sterile transcript typically can 
be expected to enhance the frequency of isotype switch 
recombination involving adjacent switch sequences. 

For these reasons, it is preferable that transgenes 
incorporate transcriptional regulatory sequences within about 
^ 2 Z upstream of each switch region that is to be utilized 
fcr isotype switching. These transcriptional regulatory 
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sequences preferably include a promoter and an enhancer 
element, and more preferably include the 5' flanking (i.e., 
upstream) region that is naturally associated (i.e., occurs in 
germline configuration) with a switch region. This 5' 
flanking region is typically about at least 50 nucleotides in 
length, preferably about at least 200 nucleotides in length, 
and more preferably at least 500-1000 nucleotides. 

Although a 5' flanking sequence from one switch 
region can be operably linked to a different switch region for 
transgene construction (e.g., the 5' flanking sequence from 
the human S yl switch can be grafted immediately upstream of the 
S al switch) , in some embodiments it is preferred that each 
switch region incorporated in the transgene construct have the 
5' flanking region that occurs immediately upstream in the 
naturally occurring germline configuration. 

The Trans genic Primary Repertoire 
A. The Human immun oglobulin Loci 

An important requirement for transgene function is 
the generation of a primary antibody repertoire that is 
diverse enough to trigger a secondary immune response for a 
wide range of antigens. The rearranged heavy chain gene 
consists of a signal peptide exon, a variable region exon and 
a tandem array of multi-domain constant region regions, each 
of which is encoded by several exons. Each of the constant 
region genes encode the constant portion of a different class 
of immunoglobulins. During B-cell development, V region 
proximal constant regions are deleted leading to the 
expression of new heavy chain classes. For each heavy chain 
class, alternative patterns of RNA splicing give rise to both 
transmembrane and secreted immunoglobulins. 

The human heavy chain locus consists of 
approximately 200 V gene segments spanning 2 Mb, approximately 
30 D gene segments spanning about 40 kb, six J segments 
clustered within a 3 kb span, and nine constant region gene 
segments spread out over approximately 300 kb. The entire 
locus spans approximately 2.5 Mb of the distal portion of the 
long arm of chromosome 14. 
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B. Gene Fragment: T ransaenes 
1. Heavy Chain Tra nsaene 

in a preferred embodiment, immunoglobulin heavy and 
light chain transgenes comprise unrearranged genomic DNA from 
numans. In the case of the heavy chain, a preferred transgene 
comprises a NotI fragment having a length between 670 to 830 
kb. The length of this fragment is ambiguous because the 3 • 
restriction site has not been accurately mapped. It is known, 
however, to reside between the al and *a gene segments. This 
fragment contains members of all six of the known V H families, 
the D and J gene segments, as well as the 6, 7 3 , 7 1 and al 
constant regions (Berman et al., MBO_iL. 2:727-738 (1988), 
which is incorporated herein by reference) . A transgenic 
mouse line containing this transgene correctly expresses a 
heavy chain class required for B-cell development (igM) and at 
least one switched heavy chain class (IgG^ , in conjunction 
with a sufficiently large repertoire of variable regions to 
trigger a secondary response for most antigens. 



20 2. T.-faht Cb a-ip Transaene 

A genomic fragment containing all of the necessary 
gene segments and regulatory sequences from a human light 
chain locus may be similarly constructed. Such transgenes are 
constructed as described in the Examples. 



25 



C. Transgenes Generated Intracellular^ 

by Tn Vivo Recp mbinati on 

It is not necessary to isolate the all or part of 
the heavy chain locus on a single DNA fragment. Thus, for 
30 example, the 670-830 kb NotI fragment from the human 
' immunoglobulin heavy chain locus may be formed in vivo in the 
non-human animal during transgenesis . Such in vivo transgene 
construction is produced by introducing two or more 
overlapping DNA fragments into an embryonic nucleus of the 
35 non-human animal. The overlapping portions of the DNA 
fragments have DNA sequences which are substantially 
homologous. Upon exposure to the recombinases contained 
within the embryonic nucleus, the overlapping DNA fragments 



WO 93/12227 PCT/US92/ 10983 

39 

homologously recombined in proper orientation to form the 
670-830 kb NotI heavy chain fragment. 

In vivo transgene construction can be used to form 
any number of immunoglobulin transgenes which because of their 
5 size are otherwise difficult, or impossible, to make or 
manipulate by present technology. Thus, in vivo transgene 
construction is useful to generate immunoglobulin transgenes 
which are larger than DNA fragments which may be manipulated 
by YAC vectors (Murray and Szostak, Nature 305:189-193 
10 (1983)). Such in vivo transgene construction may be used to 
introduce into a non-human animal substantially the entire 
immunoglobulin loci from a species not consisting of the 
transgenic non-human animal. 

In addition to forming genomic immunoglobulin 
15 transgenes, in vivo homologous recombination may also be 

utilized to form "mini-locus" transgenes as described in the 
Examples . 

In the preferred embodiments utilizing in vivo 
transgene construction, each overlapping DNA fragment 
20 preferably has an overlapping substantially homologous DNA 
sequence between the end portion of one DNA fragment and the 
end portion of a second DNA fragment. Such overlapping 
portions of the DNA fragments preferably comprise about 500 bp 
to about 2000 bp, most preferably 1.0 kb to 2.0 kb. 
25 Homologous recombination of overlapping DNA fragments to form 
transgenes in vivo is further described in commonly assigned 
PGT Publication No. WO 92/03917 entitled "Homologous 
Recombination in Mammalian Cells" published March 19, 1992. 

y 30 D. Minilocus Transgenes 



As used herein, the term "immunoglobulin minilocus" 
refers to a DNA sequence (which may be within a longer 
sequence) , usually of less than about 150 kb, typically 
between about 25 and 100 kb, containing at least one each of 
35 the following: a functional variable (V) gene segment, a 

functional joining (J) region segment, at least one functional 
constant (C) region gene segment, and — if it is a heavy chain 
minilocus— a functional diversity (D) region segment, such 
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that said DNA sequence contains at least one substantial 
discontinuity (e.g., a deletion, usually of at least about 2 
to 5 kb, preferably 10-25 kb or more, relative to the 
homologous genomic DNA sequence) . A light chain minilocus 
transgene will be at least 25 kb in length, typically 50 to 60 
kb. A heavy chain transgene will typically be about 70 to 80 
kb in length, preferably at least about 60 kb with two 
constant regions operably linked to switch regions. 
Furthermore, the individual elements of the minilocus are 
preferably in the germline configuration and capable of 
undergoing gene rearrangement in the pre-B cell of a 
transgenic animal so as to express functional antibody 
molecules with diverse antigen specificities encoded entirely 
by the -elements of the minilocus. Further, a heavy chaxn 
minilocus comprising at least two C H genes and the requisite 
switching sequences is typ.-cally c pable of undergoing isotype 
switching, so that functional antibody molecules of different 
immunoglobulin classes will be generated. Such isotype 
switching may occur in yiyo in B-cells residing within the 
transgenic nonhuman animal, or may occur in cultured cells of 
the B-cell lineage which have been explanted from the 
transgenic nonhuman animal. 

in an alternate preferred embodiment, immunoglobulin 
heavy chain transgenes comprise one or more of each of the V H , 
D, and J H gene segments and two or more of the C H genes. An 
least one of each appropriate type gene segment is 
incorporated into the minilocus transgene. With regard to the 
Ch segments for the heavy chain transgene, it is preferred 
that the transgene contain at least one m gene segment and at 
least one other constant region gene segment, more preferably 
a y gene segment, and most preferably y3 or yl. This 
preference is to allow for class switching between IgM and IgG 
forms of the encoded immunoglobulin and the production of a 
secretable form of high affinity non-IgM immunoglobulin. 
Other constant region gene segments may also be used such as 
those which encode for the production of igD, IgA and IgE. 

Those skilled in the art will also construct 
transgenes wherein the order of occurrence of heavy chain C H 
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genes will be different from the naturally-occurring spatial 
order found in the germline of the species serving as the 
donor of the C H genes. 

Additionally, those skilled in the art can select C H 
5 genes from more than one individual of a species (e.g. , 
allogeneic C H genes) and incorporate said genes in the 
transgene as supernumerary C H genes capable of undergoing 
isotype switching; the resultant transgenic nonhuman animal 
may then, in some embodiments , make antibodies of. various 

10 classes including all of the allotypes represented in the 
species from which the transgene C H genes were obtained. 

Still further, those skilled in the art can select 
C H genes from different species to incorporate into the 
transgene. Functional switch sequences are included with each 

15 C H gene, although the switch sequences used are not 

necessarily those which occur naturally adjacent to the C H 
gene. Interspecies C H gene combinations will produce a 
transgenic nonhuman animal which may produce antibodies of 
various classes corresponding to C H genes from various 

20 species. Transgenic nonhuman animals containing interspecies 
C H transgenes may serve as the source of B-cells for 
constructing hybridomas to produce monoclonals for veterinary 
uses . 

The heavy chain J region segments in the human 
25 comprise six functional J segments and three pseudo genes 
clustered in a 3 kb stretch of DNA. Given its relatively 
compact size and the ability to isolate these segments 
together with the m gene and the 5 1 portion of the 6 gene on a 
single 23 kb SFil/Spel fragment (Sado et al., Biochem. 
30 Biophys. Res. Comm. 154 :264271 (1988), which is incorporated 
herein by reference) it is preferred that all of the J region 
gene segments be used in the mini-locus construct. Since this 
fragment spans the region between the p and 6 genes, it is 
likely to contain all of the 3 9 cis-linked regulatory elements 
35 required for m expression. Furthermore, because this fragment 
includes the entire J region, it contains the heavy chain 
enhancer and the /i switch region (Mills et al., Nature 306:809 
(1983); Yancopoulos and Alt, Ann. Rev. Immunol. 4:339-368 
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It also 



(1986) , which are incorporated herein by reference) . 
contains the transcription start sites which trigger VDJ 
joining to form primary repertoire B-cells (Yancopoulos and 
Alt cell 40:271-281 (1985), which is incorporated herein by 
reference) . Alternatively, a 36 kb BssHII/Spell fragment, 
which includes part on the D region, may be used in place of 
the 23 kb Sfil/spell fragment. The use of such a fragment 
increases the amount of 5 • flanking sequence to facilitate 

efficient D-to-J joining. 

The human D region consists of 4 or 5 homologous 9 
kb subregions, linked in tandem (Siebenlist, et al. (1981), 
Nature, 2£4_, 631-635) . Each subregion contains up to 10 
individual D segments. Some of these segments have been 
m apped-and are shown in Fig. 4. Two different strategies are 
used to generate a mini-locus D region. The first strategy 
involves using only those D segments located in a short 
contiguous stretch of DNA that includes one or two of the 
repeated D subregions. A candidate is a single 15 kb fragment 
that contains 12 individual D segments. This piece of DNA 
consists of 2 contiguous EcoRI fragments and has been 
completely sequenced (Ichihara, et al. (1988), EMBO J. , 7, 
4141-4150) . Twelve D segments should be sufficient for a 
primary repertoire. However, given the dispersed nature of 
the D region, an alternative strategy is to ligate together 
several non-contiguous D-segment containing fragments, to 
produce a smaller piece of DNA with a greater number of 
segments. Additional D-segment genes can be identified, for 
example, by the presence of characteristic flanking nonamer 
and heptamer sequences, supra, and by reference to the 
literature. 

At least one, and preferably more than one V gene 
segment is used to construct the heavy chain minilocus 
transgene. Rearranged or prearranged V segments with or 
without flanking sequences can be isolated as described PCT 
Publication No. WO .92/03918, published March 19, 1992, 
entitled "Transgenic Non-Human Animals Capable of Producing 
Heterologous Antibodies . " 
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Rearranged or unrearranged V segments, D segments, J 
segments, and C genes, with or without flanking sequences, can 
be isolated as described in PCT Publication No. WO 92/03918, 
published March 19, 1992. 
5 A minilocus light chain transgene may be similarly 

constructed from the human X or k immunoglobulin locus. 
Thus, for example, an immunoglobulin heavy chain minilocus 
transgene construct, e.g., of about 7 5 kb, encoding V, D, J 
and constant region sequences can be formed from a plurality 

10 of DNA fragments, with each sequence being substantially 

homologous to human gene sequences. Preferably, the sequences 
are operably linked to transcription regulatory sequences and 
are capable of undergoing rearrangement. With two or more 
appropriately placed constant region sequences (e.g., m and 7) 

15 and switch regions, switch recombination also occurs. An 
exemplary light chain transgene construct can be formed 
similarly from a plurality of DNA fragments, substantially 
homologous to human DNA and capable of undergoing 
rearrangement, 

20 

E. Transgene constructs Capable of Isotv pe Switching 

Ideally, transgene constructs that are intended to 
undergo class switching should include all of the cis-acting 
sequences necessary to regulate sterile transcripts. 

25 Naturally occurring switch regions and upstream promoters and 
regulatory sequences (e.g., IFN-inducible elements) are 
preferred cis-acting sequences that are included in transgene 
constructs capable of isotype switching. About at least 50 
basepairs, preferably about at least 200 basepairs, and more 

30 preferably at least 500 to 1000 basepairs or more of sequence 
immediately upstream of a switch region, preferably a human 71 
switch region, should be operably linked to a switch sequence, 
preferably a human 7I switch sequence. Further, switch 
regions can be linked upstream of (and adjacent to) C H genes 

35 that do not naturally occur next to the particular switch 
region. For example, but not for limitation, a human y x 
. switch region may be linked upstream from a human a 2 C H gene , 
or a murine 7 2 switch may be linked to a human C H gene. 
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An alternative method for obtaining non-classical 
isotype switching (e.g.^ ^-associated deletion) in transgenic 
mice involves the inclusion of the 400 bp direct repeat 
sequences (cr M and en) that flank the human n gene (Yasui et 
5 a i BBSa J. Tmmunol. 19:1399 (1989)). Homologous 

recombination between these two sequences deletes the n gene 
in IgD-only B-cells. Heavy chain transgenes can be- 
represented by the following formulaic description: 

10 (V H ) x - (D) y - CJ H ) z" ( S d) ni" (Ci) n" t < T > ~ < S A> p" < C 2 > 3 g 
where: 

V H is a heavy chain variable region gene segment, 

D is a heavy chain D (diversity) region gene segment, 

15 J H is a heavy chain J (joining) region gene segment, 

S D is a donor region segment capable of participating in 
a recombination event with the S a acceptor region 
segments such that isotype switching occurs, 
C x is a heavy chain constant region gene segment encoding 

20 an isotype utilized in for B cell development (e.g., 

(i or S) , 

T is a cis-acting transcriptional regulatory region 

segment containing at least a promoter , 
S A is an acceptor region segment capable of participating 
25 in a recombination event with selected S D donor 

region segments, such that isotype switching occurs, 
C 2 is a heavy chain constant region gene segment encoding 

an isotype other than /i (e.g., 7x/ 72' *Y3' ?4' B i* 

cr 2 , e) . 

30 x, y, z, m, n, p, and q are integers, x is 1-100, n is 

0-10,. y is 1-50, p is i-10, z is 1-50, q is 0-50, m 
is 0-10. Typically, when he transgene is capable 
of isotype switching, q must be at least 1, m is at 
least 1, n is at least 1, and m is greater than or 

35 equal to n. 
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V H/ D, J H , S D , C lf T, S A , and C z segments may be 
selected from various species, preferably mammalian species, 
and more preferably from human and murine germline DNA. 
* V H segments may be selected from various species, 

5 but are preferably selected from V H segments that occur 
naturally in the human germline, such as V H251 . Typically 
about 2 V H gene segments are included, preferably about 4 V H 
segments are included, and most preferably at least about 10 
V H segments are included. 

10 At least one D segment is typically included, 

although at least 10 D segments are preferably included, and 
some embodiments include more than ten D segments. Some 
preferred embodiments include human D segments. 

Typically at least one J H segment is incorporated in 

15 the transgene, although it is preferable to include about six 
*T H segments, and some preferred embodiments include more than 
about six J H segments. Some preferred embodiments include 
human J H segments , and further preferred embodiments include 
six human J H segments and no nonhuman J H segments. 

20 S D segments are donor regions capable of 

participating in recombination events with the S A segment of 
the transgene. For classical isotype switching, S D and S A are 
switch , regions such as S^, S yi , S y2 * s 7 3# s 7 4' s a' s a2' and s e # 
Preferably the switch regions are murine or human, more 

25 preferably S D is a human or murine S^ and S A is a human or 

murine S 7 j. For nonclassical isotype switching (^-associated 
deletion) , S D and S A are preferably the 400 basepair direct 
repeat sequences that flank the human gene. 

C x segments are typically /i or S genes, preferably a 
~ 30 m gene, and more preferably a human or murine ji gene. 

T segments typically include S' flanking sequences 
that are adjacent to naturally occurring (i.e., germline) 
switch regions. T segments typically at least about at least 
50 nucleotides in length, preferably about at least 200 

35 nucleotides in length, and more preferably at least 500-1000 
nucleotides in length. Preferably T segments are 5' flanking 
sequences that occur immediately upstream of human or murine 
switch regions in a germline configuration. It is also 
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evident to those of skill in the art that T segments may 
comprise cis-acting transcriptional regulatory sequences that 
do not occur naturally in an animal germline (e.g., viral 
enhancers and promoters such as those found in SV40, 
5 adenovirus, and other viruses that infect eukaryotic cells). 

C 2 segments are typically a y lf y 2 > ?3' y*> a i» a 2' 

„ aene of these isotypes, and 

or e C H gene , preferably a human c" ^ ene 

more preferably a human Tl or 73 9 ene - Murine ?2a and T 2 b ma y 
also be used, as may downstream (i.e., switched) isotype genes 
10 form various species. Where the heavy chain transgene 

contains an immunoglobulin heavy chain minilocus, the total 
length of the transgene will be typically 150 kilo basepairs 
or less. 

In general, the transgene will be other than a 
15 native heavy chain Ig locus. Thus, for example, deletion of 
unnecessary regions or substitutions with corresponding 
regions from other species will be present. 

F. Methods for Determining Functional 
20 Tintype Sw itch ina in Ta Transgenes 

The occurrence of isotype switching in a transgenic 
nonhuman animal may be identified by any method known to those 
in the art. Preferred embodiments include the following, 
25 employed either singly or in combination: 

1. detection of mRNA transcripts that contain a sequence 
homologous to at least one transgene downstream C H gene other 
than S and an adjacent sequence homologous to a transgene V H - 
D H -j H rearranged gene; such detection may be by Northern 

30 hybridization, S x nuclease protection assays, PCR 
amplification, cDNA cloning, or other methods; 

2. detection in the serum of the transgenic animal, or in 
superaatants of cultures of hybridoma cells made from B-cells 
of the transgenic animal, of immunoglobulin proteins encoded 

35 by downstream Ch genes, where such proteins can also be shown 
by immunochemical methods to comprise a functional variable 
region; 

3. detection, in DNA from B-cells of the transgenic 
animal or in genomic DNA from hybridoma cells, of DNA 
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rearrangements consistent with the occurrence of isotype 
switching in the transgene, such detection may be accomplished 
. by Southern blot hybridization, PCR amplification, genomic 
cloning, or other method; or 
5 4. identification of other indicia of isotype switching, 

such as production of sterile transcripts, production of 
characteristic enzymes involved in switching (e.g., "switch 
recombinase") , or other manifestations that may be detected, 
measured, or observed by contemporary techniques. 

10 Because each transgenic line may represent a 

different site of integration of the transgene, and a 
potentially different tandem array of transgene inserts, and 
because each different configuration of transgene and flanking 
DNA sequences can affect gene expression, it is preferable to 

15 identify and use lines of mice that express high levels of 
human immunoglobulins, particularly of the igG isotype, and 
contain the least number of copies of the transgene. Single 
copy transgenics minimize the potential problem of incomplete 
allelic expression. Transgenes are typically integrated into 

20 host chromosomal DNA, most usually into germline DNA and 
propagated by subsequent breeding of germline transgenic 
breeding stock animals. However, other vectors and transgenic 
methods known in the present art or subsequently developed may 
be substituted as appropriate and as desired by a 

25 practitioner. 

G. Functional Disruption of 

Endogenou s Immunoglobulin Loci 

30 The expression of successfully rearranged 

immunoglobulin heavy and light transgenes is expected to have 
"a dominant effect by suppressing the rearrangement of the 
endogenous immunoglobulin genes in the transgenic nonhuman 
animal. However, another way to generate a nonhuman that is 

35 devoid of endogenous antibodies is by mutating the endogenous 
immunoglobulin loci. Using embryonic stem cell technology and 
homologous recombination, the endogenous immunoglobulin 
repertoire can be readily eliminated. The following describes 
the functional description of the mouse immunoglobulin loci. 
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The vectors and methods disclosed, however, can be readily 
adapted for use in other non-human animals. 

Briefly, this technology involves the inactivation 
of a gene, by homologous recombination, in a pluripotent cell 
line that is capable of differentiating into germ cell txssue. 
A DMA construct that contains an altered, copy of a mouse 
immunoglobulin gene is introduced into the nuclei of embryonic 
stem cells. In a portion of the cells, the introduced DNA 
recombines with the endogenous copy of the mouse gene, 
replacing it with the altered copy. Cells containing the 
newly engineered genetic lesion are injected into a host mouse 
embryo, which is reimplanted into a recipient female. Some of 
these embryos develop into chimeric mice that possess germ 
cells entirely derived from the mutant cell line. Therefore, 
15 by breeding the chimeric mice it is possible to obtain a new 
line of mice containing the introduced genetic lesion 
(reviewed by Capecchi (1989) , Science, 2£±, 1288-1292) . 

Because the mouse X locus contributes to only 5% of 
the immunoglobulins, inactivation of the heavy chain and/or 
20 ic-light chain loci is sufficient. There are three ways to 

disrupt each of these loci, deletion of the J region, deletxon 
of the J-C intron enhancer, and disruption of constant region 
coding sequences by the introduction of a stop codon. The 
last option is the most straightforward, in terms of DNA 
25 construct design. Elimination of the m gene disrupts B-cell 
maturation thereby preventing class switching to any of the 
functional heavy chain segments. The strategy for knocking 
out these loci is outlined below. 

To disrupt the mouse y. and k genes, targeting 
30 vectors are used based on the design employed by Jaenisch and 
co-workers (Zxjlstra, et al. (1989), Nature., 342, 435-438) for 
the successful disruption of the mouse 02 -microglobulin gene. 
The neomycin resistance gene (neo) , from the plasmid pMCIneo 
is inserted into the coding region of the target gene. The 
35 pMCIneo insert uses a hybrid viral promoter/ enhancer sequence 
to drive neo expression. This promoter is active in embryonxc 
stem cells. Therefore, neo can be used as a selectable marker 
for integration of the knock-out construct. The HSV thymidxne 
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kinase (tk) gene is added to the end of the construct as a 
negative selection marker against random insertion events 
(Zijlstra, et al., supra . ) . 

A preferred strategy for disrupting the heavy chain 
5 locus is the elimination of the J region. This region is 
fairly compact in the mouse, spanning only 1.3 kb. To 
construct a gene targeting vector, a 15 kb Kpnl fragment 
containing all of the secreted A constant region exons from 
mouse genomic library is isolated. The 1.3 kb J region is 

10 replaced with the 1.1 kb insert from pMCIneo. The HSV tk gene 
is then added to the 5' end of the Kpnl fragment. Correct 
integration of this construct, via homologous recombination, 
will result in the replacement of the mouse J H region with the 
neo gene. Recombinants are screened by PCR, using a primer 

15 based on the neo gene and a primer homologous to mouse 
sequences 5 f of the Kpnl site in the D region. 

Alternatively, the heavy-chain locus is knocked out 
by disrupting the coding region of the m gene. This approach 
involves the same 15 kb Kpnl fragment used in the previous 

20 approach. The 1.1 kb insert from pMCIneo is inserted at a 

unique BamHI site in exon II, and the HSV tk gene added to the 
3 9 Kpnl end. Double crossover events on either side of the 
neo insert, that eliminate the tk gene, are then selected for. 
These are detected from pools of selected clones by PCR 

25 amplification. One of the PCR primers is derived from neo 
sequences and the other from mouse sequences outside of the 
targeting vector. The functional disruption of the mouse 
immunoglobulin loci is presented in the Examples. 



30 



G. Suppressing Expression of 

Endogenous Immunoglobulin Loci 



In addition to functional disruption of endogenous 
Ig loci, an alternative method for preventing the expression 
35 of an endogenous Ig locus is suppression. Suppression of 
endogenous Ig genes may be accomplished with antisense RNA 
produced from one or more integrated transgenes, by antisense 
oligonucleotides, and/ or by administration of antisera 
specific for one or more endogenous Ig chains. 
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Polynucleotides 
Antisense RNA transgenes can be employed to 
partially or totally Jcnock-out expression of specific genes 
(Pepin et al. (1991) Nature 355: 725; Helene. , C. and Toulme, 
JT. (1990) Biochiiaisa a iSBfag^ 1049: 99; Stout, J. and 

Caslcey, T. (1990) <=™^ Sell MoT. Genet. 16: 369; Munir et 
al. (1990) ggffiafe. Cejq . Genet. 16: 383, each of which is 
incorporated herein by reference) . 

-•Antisense polynucleotides" are polynucleotides 
that: (1) are complementary, to all or part of a reference 
sequence, such as a sequence of an endogenous Ig or 
region, and (2) which specifically hybridize to a 
complementary target sequence, such as a chromosomal gene 
locus- or a Ig mRNA. Such complementary antisense 
polynucleotides may include nucleotide substitutions, 
additions, deletions, or transpositions, so long as specific 
hybridization to the relevant target sequence is retained as a 
functional property of the polynucleotide. Complementary 
antisense polynucleotides include soluble antisense RNA or DNA 
oligonucleotides which can hybridize specifically to 
individual mRNA species and prevent transcription and/or RNA 
processing of the mRNA species and/or translation of the 
encoded polypeptide (Ching et al., Pron. Natl. Acad. Sci, _ 
U.S.A. 86:10006-10010 (1989); Broder et al. , Ann. Tnt. Med . 
213_:604-618 (1990); Loreau et al. , FF.BS Letters 224:53-56 
(1990); Holcenberg et al. , W091/11535; W091/09865; WO91/04753; 
W090/13641; and EP 386563, each of which is incorporated 
herein by reference) . An antisense sequence is a 
polynucleotide sequence that is complementary to at least one 
immunoglobulin gene sequence of at least about 15 contiguous 
• nucleotides in length, typically at least 20 to 30 nucleotides 
in length, and preferably more than about 30 nucleotides in 
length. However, in some embodiments, antisense sequences may 
have substitutions, additions, or deletions as compared to the 
complementary immunoglobulin gene sequence, so long as 
specific hybridization is retained as a property of the 
antisense polynucleotide. Generally, an antisense sequence is 
complementary to an endogenous immunoglobulin gene sequence 
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that encodes, or has the potential to encode after DNA 
rearrangement, an immunoglobulin chain. In some cases, sense 
sequences corresponding to an immunoglobulin gene sequence may 
v function to suppress expression, particularly by interfering 

5 with transcription. 
» The antisense polynucleotides therefore inhibit 

production of the encoded polypeptide (s) . In this regard, 
antisense polynucleotides that inhibit transcription and/ or 
translation of one or more endogenous Ig loci can alter the 

10 capacity and/ or specificity of a non-human animal to produce 
immunoglobulin chains encoded by endogenous Ig. loci. 

Antisense polynucleotides may be produced from a 
heterologous expression cassette in a transfectant cell or 
transgenic cell, such as a transgenic pluripotent 

15 hematopoietic stem cell used to reconstitute all or part of 
the hematopoietic stem cell population of an individual, or a 
transgenic nonhuman animal. Alternatively, the antisense 
polynucleotides may comprise soluble oligonucleotides that are 
administered to the external milieu, either in culture medium 

20 in vitro or in the circulatory system or interstitial fluid in 
vivo . Soluble antisense polynucleotides present in the 
external milieu have been shown to gain access to the 
cytoplasm and inhibit translation of specific mRNA species. In 
some embodiments the antisense polynucleotides comprise 

25 methylphosphonate moieties, alternatively phosphorothiolates 
or O-raethylribonucleotides may be used, and chimeric 
oligonucleotides may also be used (Dagle et al. (1990) Nucleic 
Acids Res. 18 ; 4751). For some applications, antisense 
oligonucleotides may comprise polyamide nucleic acids (Nielsen 

30 et al. (1991) Science 254: 1497). For general methods 

relating to antisense polynucleotides, see Antisense RNA and 
DNA , (1988), D. A. Melton, Ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY) . 

Antisense polynucleotides complementary to one or 

35 more sequences are employed to inhibit transcription, RNA 

processing, and/ or translation of the cognate mRNA species and 
/thereby effect a reduction in the amount of the respective 
encoded polypeptide. Such antisense polynucleotides can 



WO 93/12227 



PCI7US92/10983 



52 



provide a therapeutic function by inhibiting the formation of 
one or more endogenous Ig chains in vivo . 

Whether as soluble antisense oligonucleotides or as 
antisense RNA transcribed from an antisense transgene, the 
5 antisense polynucleotides of this invention are selected so as 
to hybridize preferentially to endogenous Ig sequences at 
physiological conditions in vivo. Most typically, the 
selected antisense polynucleotides will not appreciably 
hybridize to heterologous Ig sequences encoded by a heavy or 
10 light chain transgene of the invention (i.e., the antisense 
oligonucleotides will not inhibit transgene Ig expression by 
more than about 25 to 35 percent) . 

a „+ -i g p-n™ sur session 
15 Partial or complete suppression of endogenous Ig 

chain expression can be produced by injecting mice with 
antisera against one or more endogenous Ig chains (Weiss et 
al. (1984) gESSs Natl- Anari. Sc i - OJ. S.A.I 8J, 211, Which is 
incorporated herein by reference) . Antisera are selected so 
20 as to react specifically with one or more endogenous Ig chains 
but to have minimal or no cross-reactivity with heterologous 
ig chains encoded by an Ig transgene of the invention. Thus, 
administration of selected antisera according to a schedule as 
typified by that of Weiss et al. op.cit. will suppress 
25 endogenous Ig chain expression but permits expression of 

heterologous Ig chain (s) encoded by a transgene of the present 
invention. 



Nucleic Acids 

30 The nucleic acids, the term "substantial homology" 

indicates that two nucleic acids, or de gnated sequences 
thereof, when optimally aligned an compared, are identical, 
with appropriate nucleotide insert: ens or deletions, in at 
least about 80% of the nucleotides, usually at least about 90% 

35 to 95%, and more preferably at least about 98 to 99.5% of the 
nucleotides. Alternatively, substantial homology exists when 
the segments will hybridize under selective hybridization 
conditions, to the complement of the strand. The nucleic 
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acids may be present in whole cells, in a cell lysate, or in a 
partially purified or substantially pure form. A nucleic acid 
is "isolated" or "rendered substantially pure" when purified 
away from other cellular components or other contaminants, 
5 e.g., other cellular nucleic acids or proteins, by standard 
techniques, including alkaline/ SDS treatment, CsCl banding, 
column chromatography, agarose gel electrophoresis and others 
well known in the art. See . F. Ausubel, et al. , ed. Current 
Protocols in Molecular Biology . Greene Publishing and Wiley- 

10 Interscience, New York (1987) . 

The nucleic acid compositions of the present 
invention, while often in a native sequence (except for 
modified restriction sites and the like) , from either cDNA, 
genomic or mixtures may be mutated, thereof in accordance with 

15 standard techniques to provide gene sequences. For coding 

sequences, these mutations, may affect amino acid sequence as 
desired. In particular, DNA sequences substantially 
homologous to or derived from native V, D, J, constant, 
switches and other such sequences described herein are 

20 contemplated (where "derived" indicates that a sequence is 
identical or modified from another sequence) . 

A nucleic acid is "operably linked" when it is 
placed into a functional relationship with another nucleic 
acid sequence. For instance, a promoter or enhancer is 

25 operably linked to a coding sequence if it affects the 

transcription of the sequence. With respect to transcription 
regulatory sequences, operably linked means that the DNA 
sequences being linked are contiguous and, where necessary to 
join two protein coding regions, contiguous and in reading 
i 30 frame. For switch sequences, operably linked indicates that 

the sequences are capable of effecting switch recombination. 

*» 

Specific Preferred Embodiments 

A preferred embodiment of the invention is an animal 
35 containing at least one, typically 2-10, and sometimes 25-50 
or more copies of the transgene described in Example 12 (e.g., 
pHCl or pHC2) bred with an animal containing a single copy of 
a light chain transgene described in Examples 5, 6, 8, or 14 , 
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and the offspring bred with the J H deleted animal described xn 
Example 10. Animals are bred to homozygosity for each of 
these three traits. Such animals have the following genotype: 
a single copy (per haploid set of chromosomes) of a human 
heavy chain unrearranged mini-locus (described in Example 12) , 
a single copy (per haploid set of chromosomes) of a rearranged 
human « light chain construct (described in Example 14) , and a 
deletion at each endogenous mouse heavy chain locus that 
removes all of the functional J H segments (described in 
Example 10) . Such animals are bred with mice that are 
homozygous for the deletion of the J H segments (Examples 10) 
to produce offspring that are homozygous for the J H deletion 
and hemizygous for the human heavy and light chain constructs. 
The resultant animals are injected with antigens and used for 
production of human monoclonal antibodies against these 
antigens. 

B cells isolated from such an animal are 
monospecific with regard to the human heavy and light chains 
because they contain only a single copy of each gene. 
Furthermore, they will be monospecific with regards to human 
or mouse heavy chains because both endogenous mouse heavy 
chain gene copies are nonfunctional by virtue of the deletion 
spanning the J H region introduced as described in Example 9 
and 12. Furthermore, a substantial fraction of the B cells 
will be monospecific with regards to the human or mouse lxght 
chains because expression of the single copy of the rearranged 
human k light chain gene will allelically and isotypically 
exclude the rearrangement of the endogenous mouse k and X 
chain genes in a significant fraction of B-cells. 

The transgenic mouse of the preferred embodiment 
will exhibit immunoglobulin production with a significant 
repertoire, ideally substantially similar to that of a native 
mouse. Thus, for example, in embodiments where the endogenous 
ig genes have been inactivated, the total immunoglobulin 
levels will range from about 0.1 to 10 mg/ml of serum, 
preferably 0.5 to 5 mg/ml, ideally at least about 1.0 mg/ml. 
When a transgene capable of effecting a switch to igG from IgM 
has been introduced into the transgenic mouse, the adult mouse 
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ratio of serum IgG to IgM is preferably about 10:1. Of 
course, the IgG to IgM ratio will be much lower in the 
immature mouse. In general, greater than about 10%, 
preferably 40 to 8 0% of the spleen and lymph node B cells 
5 express exclusively human IgG protein. 

The repertoire will ideally approximate that shown 
in a non-transgenic mouse, usually at least about 10% as high, 
preferably 25 to 50% or more. Generally, at least about a 
thousand different immunoglobulins (ideally IgG),. preferably 

10 10 4 to 10 6 or more, will be produced, depending primarily on 

the number of different V, J and D regions introduced into the 
mouse genome. These immunoglobulins will typically recognize 
about one-half or more of highly antigenic proteins, 
including, but not limited to: pigeon cytochrome C, chicken 

15 lysozyme, pokeweed mitogen, bovine serum albumin, keyhole 
limpit hemocyanin, influenza hemagglutinin, staphylococcus 
protein A, sperm whale myoglobin, influenza neuraminidase, and 
l amb da repressor protein. Some of the immunoglobulins will 
exhibit an affinity for preselected antigens of at least about 

20 10 7 M" X , preferably lO 8 ^ 1 to 10 9 K' X or greater. 

Thus, prior to rearrangement of a transgene 
containing various heavy or light chain gene segments, such 
gene segments may be readily identified, e.g. by hybridization 
or DNA sequencing, as being from a species of organism other 

25 than the transgenic animal. 

Although the foregoing describes a preferred 
embodiment of the transgenic animal of the invention, other 
embodiments are defined by the disclosure herein and more 
particularly by the transgenes described in the Examples. 

30 Four categories of transgenic animal may be defined: 

I. Transgenic animals containing an unrearranged heavy 
and rearranged light immunoglobulin transgene. 

II. Transgenic animals containing an unrearranged heavy 
and unrearranged light immunoglobulin transgene 

35 III. Transgenic animal containing rearranged heavy and an 

unrearranged light immunoglobulin transgene, and 
IV. Transgenic animals containing rearranged heavy and 
rearranged light immunoglobulin transgenes. 
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Of these categories of transgenic animal, the 
preferred order of preference is as follows II > I > IH > IV 
where the endogenous light chain genes (or at least the k 
gene) have been knocked out by homologous recombination (or 
other method) and I > II > IH >IV where the endogenous light 
chain genes have not been Knocked out and must be dominated by 
allelic exclusion. 

EXPERIMENTAL EXAMPLES 

M ETHODS AND MATERIALS 

Transgenic mice are derived according to Hogan, et 
al., "Manipulating the Mouse Embryo: A Laboratory Manual", 
cold spring Harbor Laboratory, which is incorporated herexn by 

reference. . 

Embryonic stem cells are manipulated according to 
published procedures (Teratocarcinomas and embryonic stem 
cells: a practical approach, E.J. Robertson, ed. , IRL Press, 
Washington, D.C., 1987; Zjilstra et al. , liaturs, 
(1989); and Schwartzberg et al., Science 146:799-803 (1989), 
each of which is incorporated herein by reference) . 

DNA cloning procedures are carried out according to 
J. sambrook, et al. in Molecular Cloning: A Laboratory 
Manual, 2d ed. , 1989, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., which is incorporated herexn by 
reference. 

Oligonucleotides are synthesized on an Applxed Bxo 
Systems oligonucleotide synthesizer according to 
specifications provided by the manufacturer. 

Hybridoma cells and antibodies are manipulated 
according to "Antibodies: A Laboratory Manual", Ed Harlow and 
David Lane, Cold Spring Harbor Laboratory (1988), which xs 
incorporated herein by reference. 

BYAMPLE 1 

r.on^ir. Heavy Ch - Q wiiinan Iq Transgene 
This Example describes the cloning and 
microinjection of a human genomic heavy chain immunoglobulin 
transgene which is microinjected into a murine zygote. 
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Nuclei are isolated from fresh human placental 
tissue as described by Marzluff et al., "Transcription and 
Translation: A Practical Approach", B . D . Hanunes and 
S.J. Higgins, eds., pp. 89-129, IRL Press, Oxford (1985)). 

5 The isolated nuclei (or PBS washed human spermatocytes) are 
embedded in a low melting point agarose matrix and lysed with 
EDTA and proteinase k to expose high molecular weight DNA, 
which is then digested in the agarose with the restriction 
enzyme NotI as described by M. Finney in Current Protocols in 

10 Molecular Biology (F. Ausubel, et al., eds. John Wiley & Sons, 
Supp. 4, 1988, Section 2.5.1). 

The NotI digested DNA is then fractionated by pulsed 
field gel electrophoresis as described by Anand et al. , 
Nucl. Acids Res . 17:3425-3433 (1989). Fractions enriched for 

15 the NotI fragment are assayed by Southern hybridization to 

detect one or more of the sequences encoded by this fragment. 
Such sequences include the heavy chain D segments, J segments, 
M and 71 constant regions together with representatives of all 
6 VH families (although this fragment is identified as 670 kb 

20 fragment from HeLa cells by Berman et al. (1988), supra. , we 
have found it to be as 830 kb fragment from human placental an 
sperm DNA) . Those fractions containing this NotI fragment 
(see Fig. 4) are pooled and cloned into the NotI site of the 
vector pYACNN in Yeast cells. Plasmid pYACNN is prepared by 

25 digestion of pYAC-4 Neo (Cook et al., Nucleic Acids Res. 16: 
11817 (1988)) with EcoRI and ligation in the presence of the 
oligonucleotide 5 ' - AAT TGC GGC CGC - 3 1 . 

YAC clones containing the heavy chain NotI fragment 
are isolated as described by Brownstein et al. , Science 

30 244 :1348-1351 (1989), and Green et al., Proc. Natl. Acad. Sci,_ 
USA 87:1213-1217 (1990), which are incorporated herein by 
reference. The cloned NotI insert is isolated from high 
molecular weight yeast DNA by pulse field gel electrophoresis 
as described by M. Finney, op cit. The DNA is condensed by 

35 the addition of 1 mM spermine and microinjected directly into 
the nucleus of single cell embryos previously described. 



WO 93/12227 



PCT/US92/10983 



58 



EXAMPLE 2 



Genomic k Light Chain Human Ig Transgene 
PnrmAd bv In vivo Ho mologous Recombination 

5 A map of the human k Light chain has been described 

in Lorenz et al. , ^ &sidg ^s. 15:9667-9677 (1987), which 
is incorporated herein by reference. 

A 450 kb Xhol to NotI fragment that includes all of 
C*, the 3* enhancer, all J segments, and at least five 
10 different V segments is isolated and microinjected into the 
nucleus of single cell embryos as described in Example 1. 

TTX AMPLE 3 



15 Genomic k Light chain Human Ig Transgene 

voT-med bv Tn vivo Homologous Recombination 

A 750 kb Mlul to NotI fragment that includes all of 
the above plus at least 20 more V segments is isolated as 

20 described in Example 1 and digested with BssHII to produce a 
fragment of about 400 kb. 

The 450 kb Xhol to NotI fragment plus the 
approximately 400 kb Mlul to BssHII fragment have sequence 
overlap defined by the BssHII and Xhol restriction sites. 

25 Homologous recombination of these two fragments upon 

microinjection of a mouse zygote results in a transgene 
containing at least an additional 15-20 V segments over that 
found in the 450 kb XhoI/NotI fragment (Example 2) . 

30 EXAMPLE 4 

construction of He «w Chair MjjU -Locus 
A. fonstructinn of pG PI and DGP2 

pBR322 is digested with EcoRI and Sty I and ligated 
with the following oligonucleotides to generate pGPI which 
35 contains a 147 base pair insert containing the restriction 
sites shown in Fig. 8. The general overlapping of these 
oligos is also shown in Fig. 9. 
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The oligonucleotides are: 





oligo-1 


5» 


- CTT 
TTT 


GAG 
TTG 


CCC 
CAT 


GCC 
ACT 


TAA 
GCG 


TGA 
GCC 


GCG 
- 3 » 


GGC 


TTT 


5 


oligo-2 


5 f 


- GCA 
GCA 


ATG 
TCG 


GCC 
ATA 


TGG 
TCT 


ATC 
AGA 


CAT 
GCT 


GGC 
CGA 


GCG 
GCA 


CTA 
-3 ' 


10 


oligo-3 


5» 


- TGC 
CTT 


AGA 
ACG 


TCT 
CGT 


GAA 
ACT 


TTC 
AGT 


CCG 
GCG 


GGT 
GCC 


ACC 
GCT 


AAG 
-3' 




oligo-4 


5» 


- AAT 
GCT 


TAG 
TGG 


CGG 
TAC 


CCG 
CCG 


CAC 
GGA 


TAG 
ATT 


TAC 
- 3' 


GCG 


TAA 


15 


oligo-5 


5 1 


- CAG 
TCG 


ATC 
ATG 


TGC 
CTA 


ATG 
GCG 


CTC 
CGC 


GAG 
CAT 


CTC 
GGA 


TAG 
TCC 


ATA 
- 3' 




oligo-6 


5 1 


- AGG 
AAA 


CCA 
AGC 


TTG 
CCG 


CGG 
CTC 


CCG 
ATT 


CAG 
AGG 


TAT 
CGG 


GCA 
GCT 


AAA 
- 3 » 



This plasmid contains a large polylinker flanked by 
rare cutting NotI sites for building large inserts that can be 
isolated from vector sequences for microinjection- The 
plasmid is based on pBR322 which is relatively low copy 

25 compared to the pUC based plasmids (pGPl retains the pBR322 
copy number control region near the origin of replication) . 
Low copy number reduces the potential toxicity of insert 
sequences. In addition, pGPl contains a strong transcription 
terminator sequence derived from trpA (Christie et al., Proc. 

30 Natl. Acad. Sci. USA 78:4180 (1981)) inserted between the 

ampicillin resistance gene and the polylinker. This further 
reduces the toxicity associated with certain inserts by 
preventing readthrough transcription coming from the 
ampicillin promoters. 

35 Plasmid pGP2 is derived from pGPl to introduce an 

additional restriction site (Sfil) in the polylinker. pGPl is 
digested with Mlul and Spel to cut the recognition sequences 
in the polylinker portion of the plasmid. 

The following adapter oligonucleotides are ligated 
. 40 to the thus digested pGPl to form pGP2. 



5 f CGC GTG GCC GCA ATG GCC A 3 f 
5* CTA GTG GCC ATT GCG GCC A 3' 



10 
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P GP2 is identical to pGPl except that it contains an 
additional Sfi I site located between the Mlul and Spel sites. 
This allows inserts to be completely excised with Sf il as well 
as with Notl. 

B . rnnstoicti "" pRE3 frat enhancer 3') 
An enhancer sequence located downstream of the rat 

constant region is included in the heavy chain constructs. 

T*ie heavy chain region 3« enhancer described by 
Petterson et al., Mature 344:165-168 (1990), which is 
incorporated herein by reference) is isolated and cloned. The 
rat IGH 3 • enhancer sequence is PCR amplified by using the 
following oligonucleotides : 

15 5 r CAG GAT CCA GAT ATC AGT ACC TGA AAC AGG GCT TGC 3 ' 
5' GAG CAT GCA CAG GAC CTG GAG CAC ACA CAG CCT TCC 3' 

The thus formed double stranded DNA encoding the 3 • 
enhancer is cut with BamHl and SphI and clone into BamHI/Sphl 
20 cut pGP2 to yield pRE3 (rat enhancer 3 • ) • 

C. running of Hum 3" -T-u Region 

A substantial portion of this region is cloned by 
combining two or more fragments isolated from phage lambda 

25 inserts. See Fig. 9. 

A 6.3 kb BamHI/Hindlll fragment that includes all 
human J segments (Matsuda et al., EMBO J. , 7:1047-1051 (1988); 
Ravetech et al.m Ceil, 27:583-591 (1981), which are 
incorporated herein by reference) is isolated from human 
30 genomic DMA library using the oligonucleotide GGA CTG TGT CCC 
TGT GTG ATG CTT TTG ATG TCT GGGGCC AAG. • 

An adjacent 1C kb Hindlll/Bamll fragment that 
contains enhancer, switch and constant region coding exons 
(Yasui et al., SH& T - T—unol. 19:1399-1403 (1989)) is 
35 similarly isolated using the oligonucleotide: 

CAC CAA GTT GAC CTG CCT GGT CAC AGA CCT GAC CAC CTA TGA 

An adjacent 3* 1.5 kb BamHI fragment is similarly 
isolated using clone pMOM insert as probe (pMOM is 4 kb 
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EcoRI/Hindlll fragment isolated from human genomic DNA library 
with oligonucleotide: 

CCT GTG GAC CAC CGC CTC CAC CTT CAT 
CGT CCT CTT CCT CCT 
mu membrane exon 1) and cloned into pUC19. 

pGPl is digested with BamHI and Bglll followed by 
treatment with calf intestinal alkaline phosphatase. 

Fragments (a) and (b) from Fig. 9 are cloned in the 
digested pGPl. A clone is then isolated which is oriented 
such that 5' BamHI site is destroyed by BamHI /Bgl fusion. It 
is identified as pMU (see Fig. 10) . pMU is digested with 
BamHI and fragment (c) from Fig. 9 is inserted. The 
orientation is checked with Hindlll digest. The resultant 
plasmid pHIGl (Fig. 10) contains an 18 kb insert encoding J 
and Cn segments. 

D. Clonina of Cu Region 
pGPl is digested with BamHI and Hindlll is followed 

by treatment with calf intestinal alkaline phosphatase (Fig. 
20 14). The so treated fragment (b) of Fig. 14 and fragment (c) 
of Fig. 14 are cloned into the BamHI /Hindlll cut pGPl. Proper 
orientation of fragment (c) is checked by Hindlll digestion to 
form pCONl containing a 12 kb insert encoding the Cu region. 

Whereas pHIGl contains J segments, switch and u 
25 sequences in its 18 kb insert with an Sfil 3' site and a Spel 
5' site in a polylinker flanked by NotI sites, will be used 
for rearranged VDJ segments. pCONl is identical except that 
it lacks the J region and contains only a 12 kb insert. The 
use of pCONl in the construction of fragment containing 
30 rearranged VDJ segments will be described hereinafter. 

E. cloning of *v-l Constant Region fpREG2) 

The cloning of the human 7-1 region is depicted in 

Fig. 16. 

35 Yamamura et al., Proc. Nat l. Acad. Sci. USA 

83:2152-2156 (1986) reported the expression of membrane bound 
human 7-1 from a transgene construct that had been partially 
deleted on integration. Their results indicate that the 3' 



10 



15 
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BamHI site delineates a sequence that includes the 
transmembrane rearranged and switched copy of the gamma gene 
with a V-C intron of less than 5kb. Therefore, in the 
unrearranged, unswitched gene, the entire switch region is 
included in a sequence beginning less than 5 kb from the 5' 
end of the first y-1 constant exon. Therefore it is included 
in the 5' 5.3 kb Hindlll fragment (Ellison et al. , Nucleic 
AcidsRes^ ifi: 4071-4079 (1982), which is incorporated herein 
by reference). Takahashi et al. , Cell 29: 671-679 (1982), 
which is incorporated herein by reference, also reports that 
this fragment contains the switch sequence, and this fragment 
together with the 7.7 kb Hindlll to BamHI fragment must 
include all of the sequences we need for the transgene 
construct. An intronic sequence is a nucleotide sequence of 
15 at least 15 contiguous nucleotides that occurs in an intron of 

a specified gene. 

Phage clones containing the -y-1 region are 
identified and isolated nsing the following oligonucleotide 
which is specific for the third exon of y-I (CH3) . 



10 



20 



25 



5' TGA GCC ACG AAG ACC CTG AGG 

TCA AGT TCA ACT GGT ACG TGG 3 ' 

A 7.7 kb Hindlll to Bglll fragment (fragment (a) in 
Fig. 11) is cloned into Hindlll/Bglll cut pRE3 to form pREGl. 
The upstream 5.3 kb Hindlll fragment (fragment (b) in Fig. 11) 
is cloned into Hindlll digested pREGl to form pREG2 . Correct 
orientation is confirmed by BamHI/Spel digestion. 



30 F. gQTnbiniT "T r T and Cu 

The previously described plasmid pHIGl contains 

human J segments and the c M constant region exons. To provide 

a transgene containing the Qi constant region gene segments, 

pHIGl was digested with Sfil (Fig. 10) . The plasmid pREG2 was 

35 also digested with Sfil to produce a 13.5 kb insert containing 

human Cy exons and the rat 3- enhancer sequence. These 

sequences were combined to produce the plasmid pHIG3 • (Fig. 

12) containing the human J segments, the human Q* constant 
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region, the human C7I constant region and the rat 3' enhancer 
contained on a 31.5 kb insert. 

A second plasmid encoding human Cm and human C7I 
without J segments is constructed by digesting pCONl with Sfil 
and combining that with the Sfil fragment containing the human 
Cy region and the rat 3' enhancer by digesting pREG2 with 
Sfil. The resultant plasmid, pCON (Fig. 12) contains a 26 kb 
Notl/Spel insert containing human Cm, human 71 and the rat 3 1 
enhancer sequence. 

G. Cloning n Segment 

The strategy for cloning the human D segments is 
depicted in Fig. 13* Phage clones from the human genomic 
library containing D segments are identified and isolated 
using probes specific for diversity region sequences (Ichihara 
et al., embo J . 7:4141-4150 (1988)). The following 
oligonucleotides are used: 

DXPl: 5 1 " TGG TAT TAC TAT GGT TCG GGG AGT TAT TAT 

AAC CAC AGT GTC - 3 1 

DXP4 : 5* - GCC TGA AAT GGA GCC TCA GGG CAC AGT GGG 

CAC GGA CAC TGT - 3' 

DN4: 51 - GGA GGG AGG ACA TGT TTA GGA TCT GAG GCC 

GCA CCT GAC ACC - 3' 

A 5.2 kb Xhol fragment (fragment (b) in Fig. 13) 
containing DLR1, DXPl, DXP»1, and DAI is isolated from a phage 
clone identified with oligo DXPl. 

A 3.2 kb Xbal fragment (fragment (c) in Fig. 13) 
containing DXP4 , DA4 and DK4 is isolated from a phage clone 
identified with oligo DXP4 . 

Fragments (b) , (c) and (d) from Fig. 13 are combined 
and cloned into the Xbal/Xhol site of pGPl to form pHIG2 which 
contains a 10.6 kb insert. 

This cloning is performed sequentially. First , the 
5.2 kb fragment (b) in Fig. 13 and the 2.2 kb fragment (d) of 
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Fig 13 are treated with calf intestinal alkaline phosphatase 
and cloned into pGPl digested with Xhol and Xbal. The 
resultant clones are screened with the 5.2 and 2.2 kb insert. 
Half of those clones testing positive with the 5.2 and 2.2 kb 
inserts have the 5.2 kb insert in the proper orientation as 
determined by BamHI digestion. The 3.2 kb Xbal fragment from 
Fig 13 is then cloned into this intermediate plasmxd 
containing fragments (b) and (d) to form P HIG2. This plasmid 
contains diversity segments cloned into the polylinker wxth a 
unique 5' Sfil site and unique 3' Spel site. The entire 
polylinker is flanked by NotI sites. 



H . ronfi-taructj ™ , Heavy Chain Minilocus 

The following describes the construction of a human 
heavy chain mini-locus which contain one or more V segments. 

An unrearranged V segment corresponding to that 
identified as the V segment contained in the hybridoma 
of Newkirk et al. , ,T. CTin. Invest., 8^:1511-1518 (1988), which 
is incorporated herein by reference, is isolated using the 
following oligonucleotide: 

5« - GAT CCT GGT TTA GTT AAA GAG GAT TTT 
ATT CAC CCC TGT GTC - 3' 

A restriction map of the unrearranged V segment is 
determined to identify unique restriction sites which provide 
upon digestion a DNA fragment having a length approximately 2 
kb containing the unrearranged V segment together with 5* and 
3 . flanking sequences. The 5' prime sequences will include 
promoter and other regulatory sequences whereas the 3« 
flanking sequence provides recombination sequences necessary 
for V-DJ joining. This approximately 3.0 kb V segment insert 
is cloned into the polylinker of pGB2 to form pVHl. 

pVHl is digested with Sfil and the resultant 
fragment is cloned into the Sfil site of pHIG2 to form a 
pHIG5 * • Since pHIG2 contains D segments only, the resultant 
pHIG5« plasmid contains a single V segment together with D 
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segments. The size of the insert contained in pHIG5 is 10.6 
kb plus the size of the V segment insert. 

The insert from pHIG5 is excised by digestion with 
NotI and Spel and isolated. pHIG3 • which contains J, Cm and 
c 7 l segments is digested with Spel and NotI and the 3' kb 
fragment containing such sequences and the rat 3« enhancer 
sequence is isolated. These two fragments are combined and 
ligated into NotI digested pGPl to produce pHIG which contains 
insert encoding a V segment, nine D segments, six functional J 
segments, Cm, Cy and the rat 3' enhancer. The size of this 
insert is approximately 43 kb plus the size of the V segment 
insert. 

I. Construction of Heavy Chain Minilocus 

by Homologous Recombination — m 

As indicated in the previous section, the insert of 
pHIG is approximately 43 to 45 kb when a single V segment is 
employed. This insert size is at or near the limit of that 
which may be readily cloned into plasmid vectors. In order to 
provide for the use of a greater number of V segments, the 
following describes in vivo homologous recombination of 
overlapping DNA fragments which upon homologous recombination 
within a zygote or ES cell form a transgene containing the rat 
3« enhancer sequence, the human Cm, the human C7I, human J 
segments, human D segments and a multiplicity of human V 
segments . 

A 6.3 kb BamHI/Hindlll fragment containing human J 
segments (see fragment (a) in Fig. 9) is cloned into Mlul/Spel 
digested pHIG5» using the following adapters: 

5 • GAT CCA AGC AGT 3 » 
5 • CTA GAC TGC TTG 3 ' 

5 • CGC GTC GAA CTA 3 • 

5 1 AGC TTA GTT CGA 3' 

The resultant is plasmid designated pHIGS'O 
(overlap) . The insert contained in this plasmid contains 
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human V, D and J segments. When the single V segment from 
pVHl is used, the size of this insert is approximately 17 kb 
plus 2 kb. This insert is isolated and combined with the 
insert from pHIG3- which contains the human J, Cm, 7l and rat 
5 3- enhancer sequences. Both inserts contain human J segments 
which provide for approximately 6.3 kb of overlap between the 
two DNA fragments. When coinjected into the mouse zygote, in 
vivo homologous recombination occurs generating a transgene 
equivalent to the insert contained in pHTG. 

10 ^ This approach provides for the addition of a 

multiplicity of V segments into the transgene formed in Yiyo- 
For example, instead of incorporating a single V segment into 
pHXG5 * , a multiplicity of V segments contained on (1) isolated 
genomic DNA, (2) ligated DNA derived from genomic DNA, or (3) 

15 DNA encoding a synthetic V segment repertoire is cloned into 
pHXG2 at the Sfil site to generate pHIGS- V M . The J segments 
fragment (a) of Fig. 9 is then cloned into pHIGS- V N and the 
insert isolated. This insert now contains a multiplicity of V 
segments and J segments which overlap with the J segments 

20 contained on the insert isolated from PHIG3'. When 

cointroduced into the nucleus of a mouse zygote, homologous 
recombination occurs to generate in Yivo. transgene 
encoding multiple V segments and multiple J segments, multiple 
D segments, the Cm region, the Cyl region (all from human) and 

25 the rat 3« enhancer sequence. 

EXAMPLE 5 

^^.TnietioP »f light Pnain Miniloc" 5 . 

A. PoTtsfcruction of PEMI 
30 The construction of pEMl is depicted in Fig. 16. 

- The mouse heavy chain enhancer is isolated on the Xbal to 

EcoRI 678 bp fragment (Banerji et al., Cj^l 33:729-740 (1983)) 
from phage clones using oligo: 



35 



GAA TGG GAG TGA GGC TCT CTC ATA CCC 
TAT TCA GAA CTG ACT 3' 
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This E/i fragment is cloned into EcoRV/Xbal digested 
pGPl by blunt end filling in EcoRI site. The resultant 
plasmid is designated pEmul. 

5 B. Construction Of k Light chain Minilocus 

The k construct contains at least one human V K 
segment, all five human J K segments, the human J-C K enhancer, 
human k constant region exon, and, ideally, the human 3' k 
enhancer (Meyer et al., embo J . 8:1959-1964 (1989)). The k 
10 enhancer in mouse is 9 kb downstream from C^. However, it is 
as yet unidentified in the human. In addition, the construct 
contains a copy of the mouse heavy chain J-Qi enhancers. 

The minilocus is constructed from four component 

fragments : 

15 (a) A 16 kb Smal fragment that contains the human 

exon and the 3 • human enhancer by analogy with the mouse 
locus ; 

(b) A 5» adjacent 5 kb Smal fragment, which 
contains all five J segments; 
20 (c) The mouse heavy chain intronic enhancer 

isolated from pE/xl (this sequence is included to induce 
expression of the light chain construct as early as possible 
in B-cell development. Because the heavy chain genes are 
transcribed earlier than the light chain genes, this heavy 
25 chain enhancer is presumably active at an earlier stage than 
the intronic k enhancer) ; and 

(d) A fragment containing one or more V segments. 
The preparation of this construct is as follows. 
Human placental DNA is digested with Smal and fractionated oh 
30 agarose gel by electrophoresis. Similarly, human placental 
DNA is digested with BamHI and fractionated. by 

electrophoresis. The 16 kb fraction is isolated from the Smal 
digested gel and the 11 kb region is similarly isolated from 
the gel containing DNA digested with BamHI. 
35 The 16 kb Smal fraction is cloned into Lambda FIX II 

(Stratagene, La Jolla, California) which has been digested 
with Xhol, treated with klenow fragment DNA polymerase to fill 
in the Xhol restriction digest product. Ligation of the 16 kb 
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Smal fraction destroys the Smal sites and lases Xhol sites 

The 11 kb BamHI fraction is cloned into X EMBL3 
(Strategene, La Jolla, California) which is digested with 

5 BamHI prior to cloning. 

Clones from each library were probed with the Ck 



10 



specific oligo: 



30 



35 



5' GAA CTG TGG CTG CAC CAT CTG TCT 
TCA TCT TCC CGC CAT CTG 3' 



A 16 kb Xhol insert that was subcloned into the Xhol 
cut pEMX so that Cjc is adjacent to the Smal site. The 

15 resultant plasmid was designated pKapl. 

Tbe above Ck specific oligonucleotide is used to 
probe the X EMBL3 / BamHI library to identify an 11 kb clone. A 
5 kb Smal fragment (fragment (b> in Fig. 20) is subcloned and 
subsequently inserted into pKapl digested with Smal. Those 

20 plasmids containing the correct orientation of J segments, C* 
and the Em enhancer are designated P Kap2. 

One or more Vk segments are thereafter subcloned 
into the Mlul site of P Kap2 to yield the plasmid pKapH which 
encodes the human V* segments, the human J* segments, the 

25 human Ck segments and the human Eft enhancer. This insert is 
excised by digesting pKapH with NotI and purified by agarose 
gel electrophoresis. The thus purified insert is 
microinjected into the pronucleus of a mouse zygote as 
previously described. 

C construction of k Light Chain Minilocus by 
vivo ggmolsgoas Recombination 

The 11 kb BamHI fragment is cloned into BamHI 
digested pGPl such that the 3* end is toward the Sfil site. 
The resultant plasmid is designated pKAPint. one or more V< 
segments is inserted into the polylinker between the BamHI and 
Spel sites in pKAPint to form pKapHV. The insert of pKapHV is 
excised by digestion with NotI and purified. The insert from 
P Ka P 2 is excised by digestion with NotI and purified. Each of 
these fragments contain regions of homology in that the 
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fragment from pKapHV contains a 5 kb sequence of DNA that 
include the J x segments which is substantially homologous to 
the 5 kb Smal fragment contained in the insert obtained from 
pKap2. As such, these inserts are capable of homologously 
5 recombining when micro injected into a mouse zygote to form a 
transgene encoding V K , J K and C K . 

EXAMPLE 6 

10 Isolation of Genomic Clones 

Corresponding to Rearranged and Expressed 
Copies of Immunoglobulin k Light chain Genes 

This example describes the cloning of immunoglobulin 
15 k light chain genes from cultured cells that express an 

immunoglobulin of interest. Such cells may contain multiple 
alleles of a given immunoglobulin gene. For example, a 
hybridoma might contain four copies of the k light chain gene, 
two copies from the fusion partner cell line and two copies 
20 from the original B-cell expressing the immunoglobulin of 
interest. Of these four copies, only one encodes the 
immunoglobulin of interest, despite the fact that several of 
them may be rearranged. The procedure described in this 
example allows for the selective cloning of the expressed copy 
25 of the k light chain. 

A. Double Stranded cDNA 

Cells from human hybridoma, or lymphoma, or other 
cell line that synthesizes either cell surface or secreted or 

30 both forms of IgM with a k light chain are used for the 
isolation of polyA+ RNA. The RNA is then used for the 
synthesis of oligo dT primed cDNA using the enzyme reverse 
transcriptase. The single, stranded cDNA is then isolated and 
G residues are added to the 3 1 end using the enzyme 

35 polynucleotide terminal transferase. The Gtailed 

single-stranded cDNA is then purified and used as template for 
second strand synthesis (catalyzed by the enzyme DNA 
polymerase) using the following oligonucleotide as a primer: 

40 5 1 - GAG GTA CAC TGA CAT ACT GGC ATG CCC 

CCC CCC CCC - 3 f 
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The double stranded cDNA is isolated and used for 
determining the nucleotide sequence of the 5* end of the mRNAs 
encoding the heavy and light chains of the expressed 
immunoglobulin molecule. Genomic clones of these expressed 
5 genes are then isolated. The procedure for cloning the 
expressed light chain gene is outlined in part B below. 

B. Light Chain, 

The double stranded cDNA described in. part A is 
10 denatured and used as a template for a third round of DNA 
synthesis using the following oligonucleotide primer: 

s» - GTA CGC CAT ATC AGC TGG ATG AAG TCA TCA GAT 
5 ggV^gVaAG ATG AAG-ACA GAT GGT GCA - 3 

This primer contains sequences specific for the 
constant portion of the * light chain message (TCA TCA GAT GGC 
GGG AAG ATG AAG ACA GAT GGT GCA) as well as unique sequences 
that can be used as a primer for the PCR amplification of the 
newly synthesized DNA strand (GTA CGC CAT ATC AGC TGG ATG 
AAG). The sequence is amplified by PCR using the following 
two oligonucleotide primers: 

5- - GAG GTA CAC TGA CAT ACT GGC ATG -3 • 
25 51 - GTA CGC CAT ATC AGC TGG ATG AAG -3* 



15 



20 



30 



35 



40 



The PCR amplified sequence is then purified by gel 
electrophoresis and used as template for dideoxy sequencing 
reactions using the following oligonucleotide as a primer: 

5« - GAG GTA CAC TGA CAT ACT GGC ATG -3 • 

The first 42 nucleotides of sequence will then be 
used to synthesize a unique probe for isolating the gene from 
which immunoglobulin message was transcribed. This synthetic 
42 nucleotide segment of DNA will be referred to below as 
o-kappa. 

A Southern blot of DNA, isolated from the Ig 
expressing cell line and digested individually and in pairwise 
combinations with several different restriction endonucleases 
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including Smal, is then probed with the 32-P labelled unique 
oligonucleotide o-kappa. A unique restriction endonuclease 
site is identified upstream of the rearranged V segment. 

DNA from the Ig expressing cell line is then cut 
with Smal and second enzyme (or BamHI or Kpnl if there is Smal 
site inside V segment) . Any resulting non-blunted ends are 
treated with the enzyme T4 DNA polymerase to give blunt ended 
DNA molecules. Then add restriction site encoding linkers 
(BamHI, EcoRI or Xhol depending on what site does not exist in 
fragment) and cut with the corresponding linker enzyme to give 
DNA fragments with BamHI, EcoRI or Xhol ends. The DNA is then 
size fractionated by agarose gel electrophoresis, and the 
fraction including the DNA fragment covering the expressed V 
segment is cloned into lambda EMBL3 or Lambda FIX (Stratagene, 
La Jolla, California) . V segment containing clones are 
isolated using the unique probe o-kappa. DNA is isolated from 
positive clones and subcloned into the polylinker of pKapl. 
The resulting clone is called pRKL. 

EXAMPLE 7 

Isolation of Genomic Clones 
Corresponding to Rearranged Expressed copies 
r*-F TnnTmnoCTlobulin Hea w Chain u Genes 

This example describes the cloning of immunoglobulin 
heavy chain m genes from cultured cells of expressed and 
immunoglobulin of interest. The procedure described in this 
example allows for the selective cloning of the expressed copy 
of a m heavy chain gene. 

Double-stranded cDNA is prepared and isolated as 
described herein before. The double-stranded cDNA is 
denatured and used as a template for a third round of DNA 
synthesis using the following oligonucleotide primer: 

5 • - GTA CGC CAT ATC AGC TGG ATG AAG ACA GGA GAC 

GAG GGG GAA AAG GGT TGG GGC GGA TGC - 3 • 

This primer contains sequences specific for the 
constant portion of the m heavy chain message (ACA GGA GAC GAG 
GGG GAA AAG GGT TGG GGC GGA TGC) as well as unique sequences 
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that can be used as a primer for the PGR amplification of the 
newly synthesized DNA strand (GTA CGC CAT ATC AGC TGG ATG 
AAG) . The sequence is amplified by PCR using the following 
two oligonucleotide primers: 

5. - GAG GTA CAC TGA CAT ACT GGC ATG - 3' 
5« - GTA CTC CAT ATC AGC TGG ATG AAG - 3* ' 

The PCR amplified sequence is then purified by gel 
electrophoresis and used as template for dideoxy sequencing 
reactions using the following oligonucleotide as a primer: 

5' - GAG GTA CAC TGA CAT ACT GGC ATG - 3' 

The first 42 nucleotides of sequence are then used 
to synthesize a unique probe for isolating the gene from 
which immunoglobulin message was transcribed. This synthetic 
42 nucleotide segment of DNA will be referred to below as 
o-mu. 

A Southern blot of DNA, isolated from the Ig 
expressing cell line and digested individually and in pairwise 
combinations with .several different restriction endonucleases 
including Mlul (Mlul is a rare cutting enzyme that cleaves 
between the J segment and mu CHI) , is then probed with the 
32-P labelled unique oligonucleotide o-mu. A unique 
restriction endonuclease site is identified upstream of the 

rearranged V segment. 

DNA from the Ig expressing cell line is then cut 
with Mlul and second enzyme. Mlul or Spel adapter linkers are 
then ligated onto the ends and cut to convert the upstream 
site to Mlul or-Spel. The DNA is then size fractionated by 
agarose gel electrophoresis, and the fraction including the 
DNA fragment covering the expressed V segment is cloned 
directly into the plasmid pGPI. V segment containing clones 
are isolated using the unique probe o-mu, and the insert is 
subcloned into Mlul or Mlul/Spel cut plasmid pCON2. The 
resulting plasmid is called pRMGH. 
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EXAMPLE 8 

instruction of Human k Minj loci Transgenes 
Light Chain Minilocus 

A human genomic DNA phage library was screened with 
5 kappa light chain specific oligonucleotide probes and isolated 
clones spanning the J^-C region. A 5.7 kb Clal/Xhol fragment 
containing J K 1 together with a 13 kb Xhol fragment containing 
J K 2-5 and C K into pGPld was cloned and used to create the 
plasmid pKcor. This plasmid contains J K l-5, the kappa 
10 intronic enhancer and together with 4.5 kb of 5' and 9 kb 
of 3» flanking sequences. It also has a unique 5 1 Xhol site 
for cloning V K segments and a unique 3' Sail site for 
inserting additional cis-acting regulatory sequences. 

15 V Happa genes 

A human genomic DNA phage library was screened with 
V K light chain specific oligonucleotide probes and isolated 
clones containing human V K segments. Functional V segments 
were identified by DNA sequence analysis. These clones 

20 contain TATA boxes, open reading frames encoding leader and 
variable peptides (including 2 cysteine residues), splice 
sequences, and recombination heptamer-12 bp spacer-nonamer 
sequences. Three of the clones were mapped and sequenced. 
Two of the clones, 65.5 and 65.8 appear to be functional, they 

25 contain TATA boxes, open reading frames encoding leader and 
variable peptides (including 2 cysteine residues), splice 
sequences, and recombination heptamer-12 bp spacer-nonamer 
sequences. The third clone, 65.4, appears to encode a 
pseudogene as it contains a non-canonical recombination 

30 heptamer. 

One of the functional clones, Vk 65-8, which encodes 
a Vklll family gene, was used to build a light chain minilocus 
construct. 

35 oKCl 

The kappa light chain minilocus transgene pKCl (Fig. 
32) was generated by inserting a 7.5 kb Xhol/Sall fragment 
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containing V K 65.8 into the 5- Xhol site of pKcor. The 
transgene insert was isolated by digestion with NotI prior to 
injection. 

The purified insert was micro injected into the 

5 pronuclei of fertilized (C57BL/6 x CBA)F2 mouse embryos and 
transferred the surviving embryos into pseudopregnant females 
as described by Hogan et al. (in Methods of Manipulating the 
Mouse Embryo, 1986, Cold Spring Harbor Laboratory, New York) . 
Mice that developed from injected embryos were analyzed for 

10 the presence of transgene sequences by Southern blot analysis 
of tail DNA. Transgene copy number was estimated by band 
intensity relative to control standards containing known 
quantities of cloned DNA. Serum was isolated from these 
animals and assayed for the presence of transgene encoded 

15 human Ig kappa protein by ELISA as described by Harlow and 
Lane (is Antibodies: A Laboratory Manual, 1988, Cold Spring 
Harbor Laboratory, New York) . Microtiter plate wells were 
coated with mouse monoclonal antibodies specific for human Ig 
kappa (clone 6E1, #0173, AMAC, Inc., Westbrook, ME), human IgM 

20 (Clone AF6, #0285, AMAC, Inc., Westbrook, ME) and human IgGl 
(clone JL512, #0280, AMAC, Inc., Westbrook, ME). Serum 
samples were serially diluted into the wells and the presence 
of specific immunoglobulins detected with affinity isolated 
alkaline phosphatase conjugated goat anti-human Ig 

25 (polyvalent) that had been pre-adsorbed to minimize cross- 
reactivity with mouse immunoglobulins. 

Pi g. 35 shows the results of an ELISA assay of serum 
from 8 mice (I.D. #676, 674, 673, 670, 666, 665, 664, and 
496) . The first seven of these mice developed from embryos 

30 that were injected with the pKCl transgene insert and the 
eighth mouse is derived from a mouse generated by 
microinjection of the pHCl transgera (described previously) . 
Two of the seven mice from KC1 injected embryos (I.D.#'s 666 
and 664) did not contain the transgene insert as assayed by 

35 DAN Southern blot analysis, and five of the mice (I.D.#«s 676, 
674, 673, 670, and 665) contained the transgene. All but one 
of the KC1 transgene positive animals express detectable 
levels of human Ig kappa protein, and the single non- 
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expressing animal appears to be a genetic mosaic on the basis 
of DNA Southern blot analysis. The pHCl positive transgenic 
mouse expresses human IgM and IgGl but not Ig kappa, 
demonstrating the specificity of the reagents used in the 
5 assay. 

PKC2 

the kappa light chain minilocus transgene pKC2 was 
generated by inserting an 8 kb Xhol/Sall fragment containing 
10 V K 65.5 into the 5' Xhol site of pKCl. The resulting 

transgene insert, which contains two V K segments, was isolated 
prior to microinjection by digestion with Notl. 



15 

BKZS2. 

. This construct is identical to pKCl except that it 
includes 1.2 kb of additional sequence 5' of J K and is missing 
4.5 kb of sequence 3' of V K 65.8. In additional it contains a 

20 0.9 kb Xbal fragment containing the mouse heavy chain J-m 
intronic enhancer (Banerji et al. , Cell 33:729-740 (1983)) 
together with a 1.4 kb Mlul Hindlll fragment containing the 
human heavy chain J-m intronic enhancer (Hayday et al., Nature 
307:334-340 (1984)) inserted downstream. This construct tests 

25 the feasibility of initiating early rearrangement of the light 
chain minilocus to effect allelic and isotypic exclusion. 
Analogous constructs can be generated with different 
enhancers, i.e., the mouse or rat 3* kappa or heavy chain 
enhancer (Meyer and Neuberger, embo J. 8:1959-1964 (1989); 

30 Petterson et al. Nature 344:165-168 (1990), which are 
incorporated herein by reference) . 

Rparrangpd Light Chain Transaenes 

A kappa light chain expression cassette was designed 
35 to reconstruct functionally rearranged light chain genes that 
have been amplified by PCR from human B-cell DNA. The scheme 
is outlined in Fig. 33. PCR amplified light chain genes are 
cloned into the vector pK5nx that includes 3.7 kb of 5' 
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flanking sequences isolated frora the kappa light chain gene 
65 5 The VJ segment fused to the 5' transcriptional 
sequences are then cloned into the unique Xhol site of the 
vector pK31s that includes ^2-4, the J K intronic enhancer, C c , 
5 and 9 Kb of downstream sequences. The resulting plasmid 

contains a reconstructed functionally rearranged kappa Ixght 
chain transgene that can be excised with NotI for 
microinjection into embryos. The plasmids also contain unxque 
Sail sites at the 3 ■ end for the insertion of additional cxs- 
10 acting regulatory sequences. 

Two synthetic oligonucleotides (O-130, o-131) were 
used 3 amplify rearranged kappa light chain genes from human 
spleen genomic DNA. Oligonucleotide o-l31 (gga ccc aga 
(g ,c)gg aac cat gga a(g,a) (g,a,t,c>) is complementary to the 
15 5- region of V^III family light chain genes and overlaps the 
first ATC of the leader sequence. Oligonucleotide o-130 (gtg 
caa tea att etc gag ttt gac tac aga c) is complementary to a 
sequence approximately 150 bp 3- of and includes an Xhol 
site. These two oligonucleotides amplify a 0.7 kb DNA 
20 fragment from human spleen DNA corresponding to rearranged 

V^II genes joined to J K 1 segments. The PGR amplified DNA was 
digested with Ncol and Xhol and cloned individual PGR products 
into the plasmid P NN03. The DNA sequence of 5 clones was 
determined and identified two with functional VJ joints (open 
25 reading frames) . Additional functionally rearranged light 

chain clones are collected. The functionally rearranged clones 
can be individually cloned into light chain expression 
cassette described above (Fig. 33) . Transgenic mice generated 
with the rearranged light chain constructs can be bred wxth 
30 heavy chain minilocus transgenics to produce a strain of mice 
that' express a spectrum of fully human antibodies in whxch all 
of the diversity of the primary repertoire is contributed by 
the heavy chain. One source of light chain diversity can be 
from somatic mutation. Because not all light chains will be 
35 equivalent with respect to their ability to combine with a 
variety of different heavy chains, different strains of nice, 
each containing different light chain constructs can be 
generated and tested. The advantage of this scheme, as 
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• opposed to the use of unrearranged light chain miniloci, is 
the increased light chain allelic and isotypic exclusion that 
comes from having the light chain ready to pair with a heavy 
chain as soon as heavy chain VDJ joining occurs. This 
5 combination can result in an increased frequency of B-cells 
expressing fully human antibodies, and thus it can facilitate 
the isolation of human Ig expressing hybridomas. 

NotI inserts of plasmids pIGMl, pHCl, pIGGl f pKCl, 
and pKC2 were isolated away from vector sequences by agarose 

10 gel electrophoresis. The purified inserts were microinjected 
into the pronuclei of fertilized (C57BL/6 x CBA) F2 mouse 
embryos and transferred the surviving embryos into 
pseudopregnant females as described by Hogan et al. (Hogan et 
al., Methods of Manipulating the Mouse Embryo , Cold Spring 

15 Harbor Laboratory, New York (1986)). 



EXAMPLE 9 

r eactivation of the Mouse Kappa Light c hain Gene by Homologous 
PAcombination 

20 

This example describes the inactivation of the mouse 
endogenous kappa locus by homologous recombination in 
embryonic stem (ES) cells followed by introduction of the 
mutated gene into the mouse germ line by injection of targeted 

25 ES cells bearing an inactivated kappa allele into early mouse 
embryos (blastocysts) . 

The strategy is to delete J K and C K by homologous 
recombination with a vector containing DNA sequences 
homologous to the mouse kappa locus in which a 4.5 kb segment 

30 of the locus , spanning the J K gene and C K segments, is deleted 
and replaced by the selectable marker neo. 

construction of the kappa targetin g vector 

The plasmid pGEM7 (KJ1) contains the neomycin 
35 resistance gene (neo) , used for drug selection of transf ected 
ES cells, under the transcriptional control of the mouse 
phosphoglycerate kinase (pgk) promoter (Xbal/TaqI fragment; 
Adra et al., Gene 60:65-74 (1987)) in the cloning vector pGEM- 
7Zf (+) . The plasmid also includes a heterologous 
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polyadenylation site for the neo gene, derived from the 3- 
region of the mouse pgk gene (PvuII/Hindlll fragment; Boer et 
al. # — ^ngtics. 28:299-308 (1990)). This plasmid 
was used as the starting point for construction of the kappa 
5 targeting vector. The first step was to insert sequences 
homologous to the kappa locus 3' of the neo expressxon 
cassette. 

- Mouse kappa chain sequences (Fig. 20a) were isolated 
from a genomic phage library derived from liver DNA using 
10 oligonucleotide probes specific for the C/c locus: 

5 .- GGC TGA TGC TGC ACC AAC TGT ATC CAT CTT CCC ACC ATC GAG 
-3 ' 

and for the JkS gene segment: 
15 5- CTC ACG TTC GGT GCT GGG ACC AAG CTG GAG CTG AAA CGT AAG - 



3 



An 8 kb Bglll/SacI fragment extending 3« of the 
mouse C* segment was isolated from a positive phage clone in 

20 two pieces, as a 1.2 kb Bglll/SacI fragment and a 6.8 kb SacI 
fragment, and subcloned into Bglll/SacI digested pGEM7 (KJ1) 
to generate the plasmid pNEO-K3- (Fig. 20b) . 

A 1.2 kb EcoRI/SphI fragment extending 5« of the J K 
region was also isolated from a positive phage clone. An 

25 Sphl/Xbal/Bglll/EcoRI adaptor was ligated to the SphI site of 
this fragment, and the resulting EcoRI fragment was ligated 
into EcoRI digested PNEO-K3-, in the same 5« to 3« orientation 
as the neo gene and the downstream 3- kappa sequences, to 
generate pNEO-K5'3« (Fig. 20c). 

30 Herpes Simplex Virus (HSV) thymidine kinase (TK) 

.gene was then included in the construct in order to allow for 
enrichment of ES clones bearing homologous recombinants, as 
described by Mansour et al. , Nature 336:348-352 (1988), which 
is incorporated herein by reference. The HSV TK cassette was 

35 obtained from the plasmid pGEM7 (TK) , which contains the 
structural sequences for the HSV TK gene bracketed by the 
mouse pgk promoter and polyadenylation sequences as described 
above for pGEM7 (KJ1) . The EcoRI site of pGEM7 (TK) was 
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modified to a BamHI site and the TK cassette was then excised 
as a BamHI/Hindlll fragment and subcloned into pGFlb to 
generate pGPlb-TK. This plasmid was linearized at the Xhol 
site and the Xhol fragment from pNEO-K5 » 3 • , containing the neo 
gene flanked by genomic sequences from 5 • of Jk and 3 1 of Cic r 
was inserted into pGPlb-TK to generate the targeting vector 
J/C KI (Fig. 20d) . The putative structure of the genomic 
kappa locus following homologous recombination with J/C Kl is 
shown in Fig. 20e. 

g^eration and analy s t of ES cells with targeted inactivation 
of a kappa allele 

The ES cells used were the AB-1 line grown on 
mitotically inactive SNL7 6/7 cell feeder layers (McMahon and 
Bradley, Cell 62 : 1073-1085 (1990)) essentially as described 
(Robertson, E.J. (1987) in Teratocarcinoma s and Embryonic Stem 
Cells; A Practical Approach . E.J. Robertson, ed. (Oxford: IRL 
Press), p. 71-112). Other suitable ES lines include, but are 
not limited to, the E14 line (Hooper et al. (1987) Nature 326: 
292-295), the D3 line (Doetschman et al. (1985) J. Embryo 1. 
Exp, Morph. 87 : 27-45), and the CCE line (Robertson et al. 
(1986) Nature 323 : 445-448) . The success of generating a 
mouse line from ES cells bearing a specific targeted mutation 
depends on the pluripotence of the ES cells (i.e., their 
ability, once injected into a host blastocyst, to participate 
in embryogenesis and contribute to the germ cells of the 
resulting animal) . 

The pluripotence of any given ES cell line can vary 
with time in culture and the care with which it has been 
handled. The only definitive assay for pluripotence is to 
determine whether the specific population of ES cells to be 
used for targeting cam give rise to chimeras capable of 
germline transmission of the ES genome. For this reason, 
prior to gene targeting, a portion of the parental population 
of AB-1 cells is injected into C57B1/6J blastocysts to 
ascertain whether the cells are capable of generating chimeric 
mice with extensive ES cell contribution and whether the 
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majority of these chimeras can transmit the ES genome to 
progeny. 

The kappa chain deactivation vector J/C Kl was 
digested with NotI and electroporated into AB-1 cells by the 
5 methods described (Hasty et al.. Nature, 250: 243-246 (1991)). 
Electroporated cells were plated onto 100 mm dishes at a 
density of 1-2 x 10 6 cells/dish. After 24 hours, G418 
(200/ig/ml of active component) and FIAU (0.5mM) were added to 
the medium, and drug-resistant clones were allowed to develop 
10 over 10-11 days. Clones wer picked, trypsinized, divided 
into two portions, and further expanded. Half of the cells 
derived from each clone were then frozen and the other half 
analyzed for homologous recombination between vector and 

target- sequences. 

15 DNA analysis was carried out by Southern blot 

hybridization. DNA was isolated from the clones as described 
(Laird et al. , . Acids Res. 19:4293 (1991) ) digested with 

Xbal and probed with the 800 bp EcoRl/Xbal fragment indicated 
in Fig. 20e as probe A. This probe detects a 3.7 kb Xbal 

20 fragment in the wild type locus, and a diagnostic 1.8 kb band 
in a locus which has homologous ly recombined with the 
targeting vector (see Fig. 20a and e) . Of 901 G418 and FIAU 
resistant clones screened by Southern blot analysis, 7 
displayed the 1.8 kb Xbal band indicative of a homologous 

25 recombination into one of the kappa genes. These 7 clones 
were further digested with the enzymes Bglll, Sad, and PstI 
to verify that the vector integrated homologously into one of 
the kappa genes. When probed with the diagnostic 800 bp 
EcoRI/Xbal fragment (probe A), Bglll, Sad, and PstI digests 

30 of wild type DNA produce fragments of 4.1, 5.4, and 7 kb, 

respectively, whereas the presence of a targeted kappa allele 
would be indicated by fragments of 2.4, " 5, and 5.7 kb, 
respectively (see Fig. 20a and e) . All positive clones 
detected by the Xbal digest showed the expected Bglll, Sad, 

35 and PstI restriction fragments diagnostic of a homologous 

recombination at the kappa light chain. In addition, Southern 
blot analysis of an Nsil digest of the targeted clones using a 
neo specific probe (probe B, Fig. 20e) generated only the 
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predicted fragment of 4.2 kb, demonstrating that the clones 
each contained only a single copy of the targeting vector. 

Generation of mice bearing the inactivat ed kappa chain 
5 Five of the targeted ES clones described in the 

previous section were thawed and injected into C57B1/6J 
blastocysts as described (Bradley, A. (1987) in 
Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach . E.J. Robertson, ed. (Oxford: IRL Press) , p. 113-151) 

10 and transferred into the uteri of pseudopregnant females to 
generate chimeric mice resulting from a mixture of cells 
derived from the input ES cells and the host blastocyst. The 
extent of ES cell contribution to the chimeras can be visually 
estimated by the amount of agouti coat coloration, derived 

15 from the ES cell line, on the black C57B1/6J background. 

Approximately half of the offspring resulting from blastocyst 
injection of the targeted clones were chimeric (i.e., showed 
agouti as well as black pigmentation) and of these, the 
majority showed extensive (70 percent or greater) ES cell 

20 contribution to coat pigmentation. The AB1 ES cells are an XY 
cell line and a majority of these high percentage chimeras 
were male due to sex conversion of female embryos colonized by 
male ES cells. Male chimeras derived from 4 of the 5 targeted 
clones were bred with C57BL/6J females and the offspring 

25 monitored for the presence of the dominant agouti coat color 
indicative of germline transmission of the ES genome. 
Chimeras from two of these clones consistently generated 
agouti offspring. Since only one copy of the kappa locus was 
targeted in the injected ES clones, each agouti pup had a 50 

30 percent chance of inheriting the mutated locus. Screening for 
the targeted gene was carried out by Southern blot analysis of 
Bgl II-digested DNA from tail biopsies, using the probe 
utilized in identifying targeted ES clones (probe A, Fig. 
20e) . As expected, approximately 50 percent of the agouti 

35 offspring showed a hybridizing Bgl II band of 2.4 kb in 

addition to the wild-type band of 4.1 kb, demonstrating the 
germline transmission of the targeted kappa locus. 
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In order to generate mice homozygous for the 
mutation, heterozygotes were bred together and the kappa 
genotype of the offspring determined as described above. As 
expected, three genotypes were derived from the heterozygote 
matings: wild-type mice bearing two copies of a normal kappa 
locus, heterozygotes carrying one targeted copy of the kappa 
gene and one NT kappa gene, and mice homozygous for the kappa 
mutation. The deletion of kappa sequences from these latter 
mice was verified by hybridization of the Southern blots with 
a probe specific for J K (probe C, Fig. 20a> . Whereas 
hybridization of the J K probe was observed to DNA samples from 
heterozygous and wild-type siblings, no hybridizing signal was 
present in the homozygotes, attesting to the generation of a 
novel mouse strain in which both copies of the kappa locus 
have been* inactivated by deletion as a result of targeted 
mutation. 



EXAMPLE 10 

y^^-t^-Mon Q f t b » Mouse Heavy Chain Gen" "V Homologous , 

20 Poramblnation 

This example describes the inactivation of the 
endogenous murine immunoglobulin heavy chain locus by 
homologous recombination in embryonic stem (ES) cells. The 
strategy is to delete the endogenous heavy chain J segments by 
homologous recombination with a vector containing heavy chain 
sequences from which the J H region has been deleted and 
replaced by the gene for the selectable marker neo. 



25 



30 



35 



ronsfr-r-ucti™ nf a h » *w chain targeting vector . 

Mouse heavy chain sequences containing the J H region 
(Fig. 21a) were isolated from a genomic phage library derived 
from the D3 ES cell line (Gossler et al. , Proc. Natl. Acad. , 
p ^i. P.S.A. 83_;9065-9069 (1986)) using a J H 4 specific 
oligonucleotide probe: 

5'— ACT ATG CTA TGG ACT ACT GGG GTC AAG GAA CCT CAG TCA CCG 
-3' 
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A 3.5 kb genomic Sacl/StuI fragment, spanning the J H 
region, was isolated from a positive phage clone and subcloned 
into Sacl/Smal digested pUC18. The resulting plasmid was 
% designated pUC18 J H . The neomycin resistance gene (neo) , used 

5 for drug selection of transfected ES cells , was derived from a 
* repaired version of the plasmid pGEM7 (KJl) . A report in the 

literature (Yenofsky et al. (1990) Proc. Natl, Acad. Sci. 
fU.S.A. ) 87 ; 3435-3439) documents a point mutation the neo 
coding sequences of several commonly used expression vectors, 

10 including the construct pMClneo (Thomas and Cappechi (1987) 
Cell 51 : 503-512) which served as the source of the neo gene 
used in pGEM7 (KJl) . This mutation reduces the activity of 
the neo gene product and was repaired by replacing a 
restriction fragment encompassing the mutation with the 

15 corresponding sequence from a wild-type neo clone. The 

Hindlll site in the prepared pGEM7 (KJl) was converted to a 
Sail site by addition of a synthetic adaptor, and the neo 
expression cassette excised by digestion with Xbal/Sall. The 
ends of the neo fragment were then blunted by treatment with 

20 the Klenow form of DNA poll, and the neo fragment was 

subcloned into the Nael site of pUCl8 J H , generating the 
plasmid pUC18 J H -neo (Fig. 21b) • 

Further construction of the targeting vector was 
carried out in a derivative of the plasmid pGPlb. pGPlb was 

25 digested with the restriction enzyme NotI and ligated with the 
following oligonucleotide as an adaptor: 

5«- GGC CGC TCG ACG ATA GCC TCG AGG CTA TAA ATC TAG AAG AAT 
TCC AGC AAA GCT TTG GC -3 1 

The resulting plasmid, called pGMT, was used to 
build the mouse immunoglobulin heavy chain targeting 
construct. 

The Herpes Simplex Virus (HSV) thymidine kinase (TK) 
35 gene was included in the construct in order to allow for 

enrichment of ES clones bearing homologous recombinants, as 
described by Mansour et al. (Nature 336 . 348-352 (1988)). The 
HSV TK gene was obtained from the plasmid pGEM7 (TK) by 



WO 93/12227 



PCT/US92/10983 



84 



10 



15 



20 



digestion with EcoRI and Hindlll. The TK DNA fragment was 
subcloned between the EcoRI and Hindlll sites of pGMT, 
creating the plasmid pGMT-TK (Fig. 21c) . 

To provide an extensive region of homology to the 
target sequence, a 5.9 kb genomic Xbal/Xhol fragment, situated 
5 . of the J H region, was derived from a positive genomxc phage 
clone by limit digestion of the DNA with Xhol, and partial 
digestion with Xbal. As noted in Fig. 21a, this Xbal site is 
not present in genomic DNA, but is rather derived from phage 
sequences immediately flanking the cloned genomic heavy cnaxn 
insert in the positive phage clone. The fragment was 
subcloned into Xbal/Xhol digested pGMT-TK, to generate the 
plasmid pGMT-TK-J H 5 ' (Fig. 21d) . 

The final step in the construction involved the 
excision from P UC18 J H -neo of the 2.8 kb EcoRI fragment which 
contained the neo gene and flanking genomic sequences 3- of 
J H . This fragment was blunted by Klenow polymerase and 
subcloned into the similarly blunted Xhol site of 
pGMT-TK-J H 5' - The resulting construct, J H K01 (Fig. 21e) , 
contains 6.9 kb of genomic sequences flanking the J H locus, 
with a 2.3 kb deletion spanning the J H region into which has 
been inserted the neo gene. Fig. 21f shows the structure of 
an endogenous heavy chain gene after homologous recombinatxon 
with the targeting construct. 



25 



30 



EXAMPLE 11 

, ...Hon and a n ^vsis of targeted ES cells 

ab-1 ES cells (McMahon and Bradley, £sil 
§Zi 1073-1085 (1990)) were grown on mitotically inactive 
SNL76/7 cell feeder layers essentially as described 
• (Robertson, E.J. (1987) feratocarcinomas and Embryonic Stem 

A Pra ~n~»i Annroach. E.J. Robertson, ed. (Oxford: IRI» 
Press) , pp. 71-112) . As described in the previous example, 
prior to electroporation of ES cells with the targeting 
35 construct J H K01, the pluripotency of the ES cells was 

determined by generation of AB-1 derived chimeras which were 
shown capable of germline transmission of the ES genome. 
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The heavy chain inactivation vector J H KO! was 
digested with NotI and electroporated into AB-1 cells by the 
methods described (Hasty et al., Nature 350:243-246 (1991)). 
Electroporated cells were plated into 100 mm dishes at a 
5 density of 1-2 x 10 6 cells/dish. After 24 hours, G418 

(200mg/ml of active component) and FIAU (O.SmM) were added to 
the medium, and drug-resistant clones were allowed to develop 
over 8-10 days* Clones were picked, trypsinized, divided into 
two portions, and further expanded. Half of the cells derived 
10 from each clone were then frozen and the other half analyzed 
for homologous recombination between vector and target 
sequences. 

DNA analysis was carried out by Southern blot 
hybridization. DNA was isolated from the clones as described 

15 (Laird et al. (1991) Nucleic Acids Res. 19: 4293), digested 
with StuI and probed with the 500 bp EcoRI/StuI fragment 
designated as probe A in Fig. 2 If . This probe detects a StuI 
fragment of 4.7 kb in the wild-type locus, whereas a 3 kb band 
is diagnostic of homologous recombination of endogenous 

20 sequences with the targeting vector (see Pig. 21a and f ) . Of 
525 G418 and FIAU doubly-resistant clones screened by Southern 
blot hybridization, 12 were found to contain the 3 kb fragment 
diagnostic of recombination with the targeting vector. That 
these clones represent the expected targeted events at the J H 

25 locus (as shown in Fig. 21f) was confirmed by further 

digestion with Hindlll, Spel and Hpal. Hybridization of probe 
A (see Fig. 21f) to Southern blots of Hindlll, Spel, and Hpal 
digested DNA produces bands of 2.3 kb, >10 kb, and >10kb, 
respectively, for the wild-type locus (see Fig. 21a) , whereas 

30 bands of 5.3 kb, 3.8 kb, and 1.9 kb, respectively, are 

expected for the targeted heavy chain locus (see Fig 2 If ) . 
All 12 positive clones detected by the StuI digest showed the 
predicted Hindlll, Spel, and Hpal bands diagnostic of a 
targeted J H gene. In addition, Southern blot analysis of a 

35 StuI digest of all 12 clones using a neo-specific probe (probe 
B, Fig. 21f) generated only the predicted fragment of 3 kb, 
demonstrating that the clones each contained only a single 
copy of the targeting vector. 
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10 



generation of m-i carrying the J TT deletion 

Three of the targeted ES clones described in the 
previous section were thawed and injected into C57BL/6J 
blastocysts as described (Bradley, A. (1987) in 
Tpratocarcj"^^ and ^ r yonie stain Cells: A Practical , 
Ann-roach. E.J. Robertson, ed. (Oxford* IPX Press) , p. 113-151) 
and transferred into the uteri of pseudopregnant females. The 
extent of ES cell contribution^ the chimera was visually 
estimated from the amount of agtmti coat coloration, derived 
from the ES cell line, on theM-ack C57BL/6J background. Half 
of the offspring resulting from blastocyst injection of two of 
the targeted clones were chimeric (i.e., showed agouti as well 
as black pigmentation) ; the third targeted clone did not 
generate any chimeric animals. The majority of the chimeras 
15 showed significant (approximately 50 percent or greater) ES 
cell contribution to coat pigmentation. Since the AB-1 ES 
cells are an XY cell line7*ost of the chimeras were male, due 
to sex conversion of female embryos colonized by male ES 
cells. Males chimeras were bred with C57BL/6J females and the 
20 offspring monitored for the presence of the dominant agouti 
coat color indicative of germline transmission of the ES 
genome. Chimeras from both of the clones consistently 
generated agouti offspring. Since only one copy of the heavy 
chain locus-TO targeted in the injected ES clones, each 
25 agouti pup hid a 50 percent chance of inheriting the mutated 
locus, screening for the targeted gene was carried out by 
Southern blot analysis of stul-digested DNA fjrom tail 
biopsies, using the probe utilized in identifying targeted ES 
clones (probe A, Pig. 2lf) . As expected, approximately 50 
30 percent of the agouti offspring showed a hybridizing Stul band 
of approximately 3 kb in addition to the wild-type banc f 4.7 
kb, demonstrating germline transmission of the targeted «*h 

gene segment. 

In order to generate mice homozygous for the 
35 mutation, heterozygotes were bred together and the heavy chain 
genotype of the offspring determined as described above. As 
expected, three genotypes were derived from the heterozygote 
matings: wild-type mice bearing two copies of the normal J H 
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locus, heterozygotes caring one targeted copy of the gene and 
one normal copy, and mice homozygous for the J H mutation. The 
absence of J H sequences from these latter mice was verified by 
hybridization of the Southern blots of Stul-digested DNA with 
5 a probe specific for J H (probe C, Fig. 2la) . Whereas 

hybridization of the J H probe to a 4.7 kb fragment in DNA 
samples from heterozygous and wild-type siblings was observed, 
no signal was present in samples from the J H -mutant 
homozygotes, attesting to the generation of a novel mouse 
10 strain in which both copies of the heavy chain gene have been 
mutated by deletion of the J H sequences. 

EXAMPLE 12 
Heavy Chain Minilocus Transgene 
15 A. Construction of nlasmid vectors for cloning large PNA 
sequences 
1. pGPla 

The plasmid pBR322 was digested with EcoRI and Sty I 
and ligated with the following oligonucleotides: 

20 

oligo-42 5'- caa gag ccc gcc taa tga gcg ggc ttt ttt ttg cat 
act gcg gcc get -3 ■ 

oligo-43 5 f - aat tag egg ccg cag tat gca aaa aaa age ccg etc 
25 att agg egg get -3' 

The resulting plasmid, pGPla, is designed for 
cloning very large DNA constructs that can be excised by the 

30 rare cutting restriction enzyme Notl. It contains a NotI 
restriction site downstream (relative to the ampicillin 
resistance gene, AmpR) of a strong transcription termination 
signal derived from the trpA gene (Christie et al. , Proc. 
Natl. Acad. Sci. USA 78 ; 4180 (1981)). This termination signal 

35 reduces the potential toxicity of coding sequences inserted 
into the Notl site by eliminating readthrough transcription 
from the AmpR gene. In addition, this plasmid is low copy 
relative to the pUC plasmids because it retains the pBR322 
copy number control region. The low copy number further 

40 reduces the potential toxicity of insert sequences and reduces 
the selection against large inserts due to DNA replication. 
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The vectors pGPlb, pGPlc, pGPld, and pGPlf are derived from 
pGPla and contain different polylinker cloning sites. The 
polylinker sequences are given below 

5 

pGPla 

NotI 
GCGGCCGC 

10 

pGPlb 

. _ vv-,. clal BamHI Hindlll NotI 

GCggccgcoSgagatcacta^cgattaattaaggatccagcagtaagcttgcGGCCGC 

15 

pGIlC 

20 NotI Smal Xhol Sail Hindlll BamHI SacII NotI 

GCggccgcatcccgggtctcgaggtcgacaagctttcgaggatccgcGGCCGC 

pGPld 

25 NotI Sail Hindlll Clal BamHI Xhol Not * 

GCggccgctgtcgacaagcttatcgatggatcctcgagtgcGGCCGC 

30 pGPlf 

NotI Sail Hindlll EcoRI Clal Kpnl BamHI Xhol NOtl 
GCggccgctgtcgacaagcttcgaattcagatcgatgtggtacctggatcctcgagtgcGGCCGC 

35 Each of these plasmids can be used for the construction of 
large transgene inserts that are excisable with NotI so that 
the transgene DNA can be purified away from vector sequences 
prior to microinjection. 

40 2. pGPlb 

pGPla was digested with NotI and ligated with the 
following oligonucleotides^ 

oligo-47 5«- ggc cgc aag ctt act get gga tec tta att aat cga 
45 tag tga tct cga ggc -3» 

oligo-48 5'- ggc cgc etc gag ate act ate gat taa tta agg ate 
cag cag taa get tgc -3' 



50 
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The resulting plasmid, pGPlb, contains a short 
polylinker region flanked by NotI sites. This facilitates the 
construction of large inserts that can be excised by NotI 
digestion. 

5 

3. pGPe 

The following oligonucleotides: 

oligo-44 5'- etc cag gat cca gat ate agt acc tga aac agg get 
10 tgc -3 1 

oligo-45 5 1 - etc gag cat gca cag gac ctg gag cac aca cag cct 
tec -3 • 

* 5 . ^ ^ ->. 

were used to amplify the immunoglobulin heavy chain 3 

enhancer (S. Petterson, et al., Nature 344 : 165-168 (1990)) 

from rat liver DNA by the polymerase chain reaction technique. 

The amplified product was digested with BamHI and 

20 SphI and cloned into BamHI/ SphI digested pNN03 (pNN03 is a pUC 
derived plasmid that contains a polylinker with the following 
restriction sites, listed in order: NotI, BamHI, Ncol, Clal, 
EcoRV r Xbal, Sad, Xhol, SphI, PstI, Bglll, EcoRI, Smal, Kpnl, 
Hindlll, and NotI). The resulting plasmid, pRE3, was digested 

25 with BamHI and Hindlll, and the insert containing the rat Ig 
heavy chain 3» enhancer cloned into BamHI /Hindlll digested 
pGPlb. The resulting plasmid, pGPe (Fig. 22 and Table 1) , 
contains several unique restriction sites into which sequences 
can be cloned and subsequently excised together with the 3 1 

30 enhancer by NotI digestion. 
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TABLE 1 



^^^^^^^^^^^^^^ 
^^^^^^^^^^^^^^^ 



seccaccreacGT 



Sequence of vector pGPe. 
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B. Construction of TaM expressin g minilocus transgene, pIGMl 
1. Isolation of J-u. constant region cl ones and construction 
of pJMl 

5 A human placental genomic DNA library cloned into 

the phage vector XEMBL3/SP6/T7 (Clonetech Laboratories, Inc., 
Palo Alto, CA) was screened with the human heavy chain J 
region specific oligonucleotide: 

10 oligo-l 5"- gga ctg tgt ccc tgt gtg atg ctt ttg atg tct ggg 
gcc aag -3 1 

and the phage clone Xl.3 isolated. A 6 kb Hindlll/Kpnl 
15 fragment from this clone, containing all six J segments as 
well as D segment DHQ52 and the heavy chain J-M intronic 
enhancer, was isolated. The same library was screened with 
the human /x specific oligonucleotide: 

20 oligo-2 5'- cac caa gtt gac ctg cct ggt cac aga cct gac cac 
eta tga -3 • 

and the phage clone X2.1 isolated. A 10.5 kb Hindlll/Xhol 
25 fragment, containing the /i switch region and all of the /x 
constant region exons, was isolated from this clone. These 
two fragments were ligated together with KpnI/XhoI digested 
pNN03 to obtain the plasmid pJMl. 

30 2. pJM2 

A 4 kb Xhol fragment was isolated from phage clone 
X2.1 that contains sequences immediately downstream of the 
sequences in pJMl, including the so called Em element involved 
in ^-associated deleteon of the ju in certain IgD expressing 

35 B-ceils (Yasui et al., Eur, J - Immunol. 19:1399 (1989), which 
is incorporated herein by reference) . This fragment was 
treated with the Klenow fragment of DNA polymerase I and 
ligated to Xhol cut, Klenow treated, pJMl. The resulting 
plasmid, pJM2 (Fig. 23), had lost the internal Xhol site but 

40 retained the 3 1 Xhol site due to incomplete reaction by the 
Klenow enzyme. pJM2 contains the entire human J region, the 
heavy chain J-/* intronic enhancer, the m switch region and all 
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of the /* constant region exons, as well as the two 0.4 kb 
direct repeats, a/x and Em, involved in ^-associated deletion 
of the n gene. 

5 3 . T^.Hon Qf " re giSE and constat i on of pDHl 

The following human D region specific 
ol igonucleotide : 

oligo-4 5- tgg tat tac tat ggt teg ggg agt tat tat aac cac 
10 agt gtc -3* 

was used to screen the human placenta genomic library for D 
region clones. Phage clones X4.1 and X4.3 were isolated. A 
5 5 kb Xhol fragment, that includes the D elements D^, D N1 , 
a„ d Dm2 (ichihara et 1., EMBO J. 7:4141 (1988)), was isolated 
from phage clone X4.1. An adjacent upstream 5.2 kb Xhol 
fragment, that includes the D elements D^, D^, D^^, and 
D& „ was isolated from phage clone X4.3. Each of these D 
region Xhol fragments were cloned into the Sail site of the 
plasmid vector pSP72 (Promega, Madison, WI) so as to destroy 
the Xhol site linking the two sequences. The upstream 
fragment was then excised with Xhol and Smal, and the 
downstream fragment with EcoRV and Xhol. The resulting 
isolated fragments were ligated together with Sail digested 
PSP72 to give the plasmid pDHl. pDHl contains a 10.6 kb 
insert that includes at least 7 D segments and can be excised 
with Xhol (5') and EcoRV (3»)- 



15 



20 



25 



35 



30 4. i . 

The plasmid pJM2 was digested with Asp7l8 (an 

isoschizomer of-Kpnl) and the overhang filled in with the 

Klenow fragment of DNA polymerase I. The resulting DNA was 

then digested with Clal and the insert isolated. This insert 

was ligated to the XhoI/EcoRV insert of pDHl and Xhol/Clal 

digested pGPe to generate pCORl (Fig. 24). 



5. pv»251 . . 

A 10.3 kb genomic HindllX fragment containing the 
40 two human heavy chain variable region segments V H 251 and V H 105 
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(Humphries et al. , Nature 3_3l:44 6 (1988), which' is 
incorporated herein by reference) was subcloned into pSP72 to 
give the plasmid pVH251. 

5 6. pIGMl 

The plasmid pCORl was partially digested with Xhol 
and the isolated Xhol/Sall insert of pVH251 cloned into the 
upstream Xhol site to generate the plasmid pIGMl (Fig. 25). 
pIGMl contains 2 functional human variable region segments, at 

10 least 8 human D segments all 6 human J H segments, the human 

enhancer, the human op element, the human M switch region, 
all of the human /x coding exons, and the human E/i element, 
together with the rat heavy chain 3' enhancer, such that all 
of these sequence elements can be isolated on a single 

15 fragment, away from vector sequences, by digestion with NotI 
and microinjected into mouse embryo pronuclei to generate 
transgenic animals. 

V 

C. Construction of TaM and J aG expressing minilocus 
20 transaene. PHC1 

1. Isolatio n of -v constant region clones 

The following oligonucleotide, specific for human Ig 
g constant region genes: 

25 oligo-29 5 1 - cag cag gtg cac acc caa tgc cca tga gcc cag aca 
ctg gac -3* 

was used to screen the human genomic library. Phage clones 
129.4 and X29.5 were isolated. A 4 kb Hindlll fragment of 

30 phage clone \29.4, containing a y switch region, was used to 
probe a human placenta genomic DNA library cloned into the 
phage vector lambda FIX"* II (Stratagene, La Jolla, CA) . Phage 
clone XSgl.13 was isolated. To determine the subclass of the 
different y clones, dideoxy sequencing reactions were carried 

35 out using subclones of each of the three phage clones as 
templates and the following oligonucleotide as a primer: 



oligo-67 5 1 - tga gcc cag aca ctg gac -3 f 
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Phage clones X29.5 and were both determined 

to be of the 71 subclass. 



2. pyel 

5 a 7.8 kb Hindlll fragment of phage clone X29.5, 

containing the 7! coding region was cloned into pUCls. The 
resulting plasmid, pLTl r was digested with Xhol. Klenow 
treated, and religated to destroy the internal Xhol site. The 
resulting clone, pLTlxk, was digested with Hindlll and the 

10 insert isolated and cloned into P SP72 to generate the plasmid 
clone pLTlxks. Digestion of pLTlxks at a polylinker Xhol site 
and a human sequence derived BamHI site generates a 7.6 kb 
' fragment containing the 71 constant region coding exons. This 
7.6 kb Xhor/BamHT fragment was cloned together with an 

15 adjacent downstream 4.5 kb BamHI fragment from phage clone 
X29.5 into XhoI/BamHI digested pGPe to generate the plasmid 
clone p T el. PTel contains all of the 7 1 constant region 
coding exons, together with 5 kb of downstream sequences, 
linked to the rat heavy chain 3« enhancer. 



20 



25 



30 



3. . . % 

A 5.3 kb Hindlll fragment containing the yl swxtch 
region and the first exon of the pre-switch sterile transcript 
(P Sideras et al. (1989) International Immunol. 1, 631) was 
isolated from phage clone XS 7 1.13 and cloned into pSP72 with 
the polylinker Xhol site adjacent to the 5« end of the insert, 
to generate the plasmid clone pS 7 ls. The Xhol/Sall insert of 
ps 7 is was cloned into Xhol digested p 7 el to generate the 
plasmid clone pre2 (Fig. 26) . pre2 contains all of the yl 
constant region coding exons, and the upstream switch region 
and sterile transcript exons, together with 5 kb of downstream 
sequences, linked to the rat heavy chain 3' enhancer. This 
clone contains a unique Xhol site at the 5« end of the insert. 
The entire insert, together with the Xhol site and the 3* rat 
35 enhancer can be excised from vector sequences by digestion 
with Notl. 
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4. PHC1 

The plasmid pIGMl was digested with Xhol and the 43 
kb insert isolated and cloned into Xhol digested pge2 to 
generate the plasmid pHCl (Fig. 25) . pHCl contains 2 
5 functional human variable region segments, at least S human D 
segments all 6 human J H segments, the human J-m enhancer, the 
human op element, the human m switch region, all of the human 
p coding exons, the human Em element, and the human yl 
constant region, including the associated switch region and 
10 sterile transcript associated exons, together with the rat 
heavy chain 3 1 enhancer, such that all of these sequence 
elements can be isolated on a single fragment, away from 
vector sequences, by digestion with NotI and microinjected 
into mouse embryo pronuclei to generate transgenic animals, 

15 

D. construction of Ia M and IaG expressing minilocus 
transaene » pHC2 

1. Isolation of human heavy chain V region aene VH49.8 

The human placental genomic DNA library lambda, FIX™ 
20 II, Stratagene, La Jolla, CA) was screened with the following 
human VH1 family specific oligonucleotide: 

oligo-49 5»- gtt aaa gag gat ttt att cac ccc tgt gtc etc tec 
aca ggt gtc -3 • 

25 

Phage clone X49.8 was isolated and a 6.1 kb Xbal 
fragment containing the variable segment VH49.8 subcloned into 
pNNQ3 (such that the polylinker Clal site is downstream of 
30 VH49.8 and the polylinker Xhol site is upstream) to generate 
the plasmid pVH49.8. An 800 bp region of this insert was 
sequenced, and VH49.8 found to have an open reading frame and 
intact splicing and recombination signals, thus indicating 
that the gene is functional (Table 2). 
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TABLE 2 



assaraGS gcttsszctc tcsgcstcxx: .-xsctt^ ^ 
^ , — ■ j.j.^c .-jtct^stc .-jyrrrrsG ::o 

.•M>.A.U;U 

CTGIvSAXEA CCCT3CCTCA TCSATXZGCA AATaATCTS. 150 

-saaamm g^sttgg tstctsga gsicscata acmcggc 200 

rcR s cac: atosscigsa 250 

MstAspTrpT 

T CE==-:.-ydS GIGSCSKSG CSOGgtaa ggggccrccc 300 
nrTrnAraeh eLeuPneVal ValAlaAlaA lalhr 

sncccssagg: raaggaagg gatcccggcr ragcrsaaga ggamzarr 350 



^ , . ■cccr=z=cac agGIGICCSG TCCCAGGICE 



400 



GLyValGln SerGlnVaUS loIjguValGl 
CTdGGSGST GBGGISaGR aOOCSGGSTC' CTOGSIGAaG G1U1LLIGCA 450 
nSerGLyAla GluValLysL ysProGLySe rSerValLys ValSex& sL 

MGcrrcTQS MoaDcrrc araociKEG ottcrgcig GGTGaacaG soo 

vsAlaSerGL yGLvThrPhe SerSerTyrA lalleSerTr pValArqGln 

^uumv3a c abgssctga uiu^ i u^ aasarcKra: eraiu^'iGS sso 

ALaPrcGLyG laGLyLeuGL ultgMetGIy ArgllelleP roneLe^. 

tnaaonaac tbcqcrck». agitccrggs caaGieRCG anacui^ eoo 

ylleAlaAsn TyrftlaGlnL ysPheGHd yArgVallhr IleThrAlaA 

acaaMcrac gmsscagoc TacaroGRGC TGRGCAGocr ^^^J? 3 650 

snLvsSerlti zSerThrftla Tyrt^GluLeuSer^rl^ uArgSerOAi 

Socggods TsranaciG TQcsoGfiEiaSEGSA aaotackk: too 

amffhr ftlaV alTvrTvrCy sAlaftrg . 

SasGnEasScfe gsgososg gmcigigc cGGocroaGG tso 
aghhogs GmariBGG ttevagszig tttrcaaaat gqsteaikea aoo 

812 

TOKSGAAftA AA 



Sequence oOniman fondly gene V H 49.8 
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2. PV2 

A 4 kb Xbal genomic fragment containing the human 
V H IV family gene V H 4-2l (Sanz et al., EMBQ J., 8:3741 (1989)), 
subcloned into the plasmid pUC12 , was excised with Smal and 
5 Hindlll, and treated with the Klenow fragment of polymerase I. 
The blunt ended fragment was then cloned into Clal digested, 
Klenow treated, pVH49.8. The resulting plasmid, pV2, contains 
the human heavy chain gene VH49.8 linked upstream of VH4-21 in 
the same orientation, with a unique Sail site at the 3' end of 
10 the insert and a unique Xhol site at the 5» end. 



3. PS-vl-5' 

A 0.7 kb Xbal/Hindlll fragment (representing 
sequences immediately upstream of, and adjacent to, the 5.3 kb 

15 71 switch region containing fragment in the plasmid P7e2) 

together with the neighboring upstream 3.1 kb Xbal fragment 
were isolated from the phage clone XSgl.13 and cloned into 
Hindlll/Xbal digested pUC18 vector. The resulting plasmid, 
pS-yl-5 • , contains a 3 . 8 kb insert representing sequences 

20 upstream of the initiation site of the sterile transcript 
' found in B-cells prior to switching to the 71 isotype (P. 
Sideras et al., International Immunol, 1;631 (1989)). Because 
the transcript is implicated in the initiation of isotype 
switching, and upstream cis-acting sequences are often 

25 important for transcription regulation, these sequences are 

included in transgene constructs to promote correct expression 
of the sterile transcript and the associated switch 
recombination . 



30 4. pVGEI 

The -PS7I-5 • insert was excised with Smal and 
Hindlll, treated with Klenow enzyme, and ligated with the 
following oligonucleotide linker: 



35 5 1 - ccg gtc gac egg -3* 



The ligation product was digested with Sail and ligated to 
Sail digested pV2. The resulting plasmid, pVP, contains 3.8 
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kb of 71 switch 5' flanking sequences linked downstream of the 
two human variable gene segments VH49.8 and VH4-21 (see Table 
2) The pVP insert is isolated by partial digestion wxth Sail 
and complete digestion with Xhol, followed by purification of 
5 the 15 kb fragment on an agarose gel- The insert is then 
cloned into the Xhol site of p 7 e2 to generate the plasmid 
clone pVGEl (Fig. 27) . pVGEl contains two human heavy chain 
variable gene segments upstream of the human 7 l constant gene 
and associated switch region. A unique Sail site between the 
10 variable and constant regions can be used to clone in D, J, 
and n gene segments. The rat heavy chain 3- enhancer is 
linked to the 3- end of the T l gene and the entire insert is 
flanked by NotI sites. 

15 5. pHC2 

The plasmid clone pVGEl is digested with Sail and 
the Xhol insert of pIGMl is cloned into it. The resulting 
clone, P HC2 (Fig. 25) , contains 4 functional human variable 
region segments, at least 8 human D segments all 6 human J H 

20 segments, the human J-m enhancer, the human an element, the 
human n switch region, all of the human n coding exons, the 
human Zfi element, and the human T l constant region, including 
the associated switch region and sterile transcript associated 
exons, together with 4 kb flanking sequences upstream of the 

25 sterile transcript initiation site. These human sequences are 
linked to the rat heavy chain 3« enhancer, such that all of 
the sequence elements can be isolated on a single fragment, 
away from vector sequences, by digestion with NotI and 
microinjected into mouse embryo pronuclei to generate 

30 transgenic animals. A unique Xhol site at the 5* end of the 
insert can be used to clone in additional human variable gene 
segments to further expand the recombinational diversity of 
this heavy chain mini locus. 

35 E. Transge nic mice 

The NotI inserts of plasmids pIGMl and pHCl were 
isolated from vector sequences by agarose gel electrophoresis. 
The purified inserts were microinjected into the pronuclei of 
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fertilized (C57BL/6 x CBA)F2 mouse embryos and transferred the 
surviving embryos into pseudopregnant females as described by 
Hogan et al. (B. Hogan f F. costantini, and E. Lacy, Methods of 
Manipulating the Mouse Embryo, 1986, Cold Spring Harbor 

5 Laboratory, New York) . Mice that developed from injected 

embryos were analyzed for the presence of transgene sequences 
by Southern blot analysis of tail DNA. Transgene copy number 
was estimated by band intensity relative to control standards 
containing known quantities of cloned DNA. At 3 to 8 weeks of 

10 age, serum was isolated from these animals and assayed for the 
presence of transgene encoded human IgM and IgGl by ELISA as 
described by Harlow and Lane (E. Harlow and D. Lane* 
Antibodies: A Laboratory Manual, 1988, Cold Spring Harbor 
Laboratory, New York) . Microtiter plate wells were coated 

15 with mouse monoclonal antibodies specific for human IgM (clone 
AF6, #0285, AMAC, Inc. Westbrook, ME) and human IgGl (clone 
JL512, #0280, AMAC, Inc. Westbrook, ME). Serum samples were 
serially diluted into the wells and the presence of specific 
immunoglobulins detected with affinity isolated alkaline 

20 phosphatase conjugated goat anti-human Ig (polyvalent) that 
had been pre-adsorbed to minimize cross-reactivity with mouse 
immunoglobulins. Table 3 and Fig. 28 show the results of an 
ELISA assay for the presence of human IgM and IgGl in the 
serum of two animals that developed from embryos injected with 

25 the transgene insert of plasmid pHCl. All of the control non- 
transgenic mice tested negative for expression of human IgM 
and IgGl by this assay. Mice from two lines containing the 
pIGMl NotI insert (lines #6 and 15) express human IgM but not 
human IgGl. We tested mice from 6 lines that contain the pHCl 

30 insert and found that 4 of the lines (lines #26, 38, 57 and 
122) express both human IgM and human IgGl, while mice from 
two of the lines (lines #19 and 21) do not express detectable 
levels of human immunoglobulins. The pHCl transgenic mice 
that did not express human immunoglobulins were so-called G Q 

35 mice that developed directly from micro injected embryos and 
may have been mosaic for the presence of the transgene. 
Southern blot analysis indicates that many of these mice 
contain one or fewer copies of the transgene per cell. The 
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detection of human IgM in the serum of pIGMl transgenics, and 
human IgM and IgGl in pHCl transgenics, provides evidence that 
the transgene sequences function correctly in directing VDJ 
joining, transcription, and isotyp switching. One of the 
animals (#18) was negative for the transgene by Southern blot 
analysis, and showed no detectable levels of human IgM or 
IgGl. The second animal (#38) contained approximately 5 
copies of the transgene, as assayed by Southern blotting, and 
showed detectable lev< s of both human IgM and IgGl. The 
results of ELISA assays for 11 animals that developed from 
transgene injected embryos is summarized in the table below 
(Table 3) . 

TABLE 3 



15 


Detection of human IgM and IgGl in the 
animals by ELISA assay 


serum of transgenic 


20 


injected 
animal # transqene 


approximate 
transgene 
copies per cell human IgM 


human TcrGl 




6 


pIGMl 


1 




mm 


25 


7 


pIGMl . 


0 








9 


pIGMl 


0 








10 


pIGMl 


0 






30 


12 


pIGMl 


0 








15 


pIGMl 


10 






35 


18 


pHCl 


0 








19 


pHCl 


1 








21 


pHCl 


<1 






40 


26 


pHCl 


2 


+ + 


+ 




38 


pHCl 


5 


+ + 


+ 
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Table 3 shows a correlation between the presence of 
integrated transgene DNA and the presence of transgene encoded 
immunoglobulins in the serum. Two of the animals that were 
5 found to contain the pHCl transgene did not express detectable 
levels of human immunoglobulins. These were both low copy 
animals and may not have contained complete copies of the 
transgenes, or the animals may have been genetic mosaics 
(indicated by the <1 copy per cell estimated for animal #21) , 

10 and the transgene containing cells may not have populated the 
hematopoietic lineage. Alternatively, the transgenes may have 
integrated into genomic locations that are not conducive to 
their expression. The detection of human IgM in the serum of 
pIGMl transgenics, and human IgM and IgGl in pHCl transgenics, 

15 indicates that the transgene sequences function correctly in 
directing VDJ joining, transcription, and isotype switching. 

F. cDNA clones 

To assess the functionality of the pHCl transgene in 

20 VDJ joining and class switching, as well the participation of 
the transgene encoded human B-cell receptor in B-cell 
development and allelic exclusion, the structure of 
immunoglobulin cDNA clones derived from transgenic mouse 
spleen mRNA were examined. The overall diversity of the 

25 transgene encoded heavy chains, focusing on D and J segment 
usage, N region addition, CDR3 length distribution, and the 
frequency of joints resulting in functional mRNA molecules was 
examined. Transcripts encoding IgM and igG incorporating 
VH105 and VH251 were examined. 

30 Polyadenylated RNA was isolated from an eleven week 

old male second generation line-57 pHCl transgenic mouse. 
This RNA was used to synthesize oligo-dT primed single 
stranded cDNA. The resulting cDNA was then used as template 
for four individual PCR amplifications using the following 

35 four synthetic oligonucleotides as primers: VH251 specific 

oligo-149, eta get cga gtc caa gga gtc tgt gec gag gtg cag ctg 
(g#a,t,c); VH105 specific o-150, gtt get cga gtg aaa ggt gtc 
cag tgt gag gtg cag ctg (g,a,t,c); human gammal specific 
oligo-151, ggc get cga gtt cca cga cac cgt cac egg ttc; and 
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huaan mu specif ic oligo-152, cct get cga ggc age caa egg cca 
cgc tgc teg. Reaction 1 used primers 0-149 and o-151 to 
amplify VH251-gammal transcripts, reaction 2 used o-149 and o- 
152 to amplify VH251-mu transcripts, reaction 3 used o-150 and 
c-151 to amplify VHlOS-gammal transcripts, and reaction 4 used 
c-150 and o-152 to amplify VH105-mu transcripts. The 
resulting 0.5 kb PCR products were isolated from an agarose 
gel,- the fi transcript products were more abundant than the y 
transcript products, consistent with the corresponding ELISA 
data (Fig. 34) . The PGR products were digested with Xhol and 
cloned into the plasmid P NN03 . Double-stranded plasmid DNA 
was isolated from minipreps of nine clones from each of the 
four PCR amplifications and dideoxy sequencing reactions were 
performed. Two of the clones turned out to be deletions 
15 containing no D or J segments. These could not have been 
derived from normal RNA splicing products and are likely to 
have originated from deletions introduced during PCR 
amplification. One of the DNA samples turned out to be a 
mixture of two individual clones, and three additional clones 
20 did not produce readable DNA sequence (presumably because the 
DNA samples were not clean enough) . The DNA sequences of the 
VDJ joints from the remaining 30 clones are compiled in Table 
4. Each of the sequences are unique, indicating that no 
single pathway of gene rearrangement, or single clone of 
25 transgene expressing B-cells is dominant. The fact that no 
two sequences are alike is also an indication of the large 
diversity of immunoglobulins that qan be expressed from a 
compact minilocus containing only 2 V segments, 10 D segments, 
and 6 J segments. Both of the V segments, all six of the J 
segments, and 7 of the 10 D segments that are included in the 
transgene are used in VDJ joints. In addition, both constant 
region genes (mu and gammal) are incorporated into 
transcripts. The VH105 primer turned out not to be specific 
for VH105 in the reactions performed. Therefore many of the 
clones from reactions 3 and 4 contained VH251 transcripts. 
Additionally, clones isolated from ligated reaction 3 PCR 
product turned out to encode IgM rather than I<jG; however this 
may reflect contamination with PCR product from reaction 4 as 
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the DNA was isolated on the same gel. An analogous 
experiment, in which immunoglobulin heavy chain sequences were 
amplified from adult human peripheral blood lymphocytes (PBL) , 
and the DNA sequence of the VDJ joints determined, was 
recently reported by Yamada et al. (J. Exp. Med. 123:395-407 
(1991), which is incorporated herein by reference). We 
compared the data from human PBL with our data from the pHCl 
transgenic mouse. 
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G. J segment: choice 

Table 5 compared the distribution of J segments 
incorporated into pHCl transgene encoded transcripts to J 
segments found in adult human PBL immunoglobulin transcripts. 
The distribution profiles are very similar, J4 is the dominant 
segment in both systems, followed by J6. J2 is the least 
common segment in human PBL and the transgenic animal. 

table 5 J* Segment Choice 

Percent Usage (± 3%) 
J. Segment HCl transgenic Human PBL 

Jl 7 1 

J2 3 <! 

. J3 17 9 

J4 44 53 

J5 3 15 

J6 26 22_ 

100% 100% 



H. P segment choice 

25 49% (40 of 82) of the clones analyzed by Yamada et 

al. incorporated D segments that are included in the pHCl 
transgene. An additional 11 clones contained sequences that 
were not assigned by the authors to any of the known D 
segments. Two of these 11 unassigned clones appear to be 

30 derived from an inversion of the DIR2 segments which is 

included in the pHCl construct. This mechanism, which was 
predicted by Ichihara et al. f EMBO J . 7:4141 (1988)) and 
observed by Sanz f J, Immunol . 147:1720-1729 (1991)), was not 
considered by Yamada et al. f J. Exp. Med , 123:395-407 (1991)). 

35 Table 5 is a comparison of the D segment distribution for the 
pHCl transgenic mouse and that observed for human PBL 
transcripts by Yamada et al. The data of Yamada et al. was 
recompiled to include DIR2 use, and to exclude D segments that 
are not in the pHCl transgene. Table 6 demonstrates that the 

40 distribution of D segment incorporation is very similar in . the 
transgenic mouse and in human PBL. The two dominant human D 
segments/ DXP f l and DN1, are also found with high frequency in 
the transgenic mouse. The most dramatic dissimilarity between 
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the two distributions is the high freguency of DHQS2 -J** 
transgenic mouse as compared to the human. The high freguencj 
of DHQ52 is reminiscent of the D segment distribution in the 
Len fetal liver. Sanz has observed that 14% - * 
chain transcripts contained DHQ52 seguences If D 
not found in pHCl are excluded fro- the analysis, 31% of the 
fetal transcripts analyzed by sanz contain DHQ52 . This is 
comparable to Le 27% that ve observe in the pHCl transgenic 
mouse. 

TABEE 6 D Segment Choice 

Percent Usage (±3%) 
n segment HS3 i-ransgenjc BUffiarLPBi 



25 



<1 



<1 



15 " DLR1 ^ 6 

DAI ^ 12 

DK1 ; 22 

20 DN1 " 4 

DIR2 ( 2 

DM2 <: t 4 

DLR2 I 2 

DHQ52 ?7 -12- 



100% 100% 



30 I. 



Punetiszia lita of ™" joints 

Table 7 shows the predicted amino acid sequences of 

the VDJ regions from 30 clones that were ^^^Hs q£ 
pHCl transgenic. The translated seguences xndxcate that 23 
the 30 VDJ joints (77%) are in-frame with respect to the 
35 variable and J segments. 
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Functionality of v-d-J Joints 



PCT/US92/ 10983 



FR3 CDR3 



FR4 



1 YH251 359052 J2 *l 

2 VH251 CHI -?4 ?» 

3 VH251 2? JS ri 

4 VH251 CXP'l Jo M 

5 VH251 SXP'l Ji Tl 
€ VH25I C? -J3 ft 
1 VH251 DBQS2 J3 it 

6 VH251 SaQ52 JS a 

9 YH251 - JI » 

10 VBSI CLR2 J4 » 
:i VH251 CXP*1 J4 it 
12 VB291 O? Jl » 
U VK251 0BQS2 JS it 

;4 VBZ5X oocrn js it 

15 VBSI aori -4 ti 

15 VHIOS 0JCT1 JS it 

1? VH251 OXF'l J4 ji 

18 VH251 DHQS2 J4 * 

19 VBSI Cad JS 7t 
23 VB2S1 DHQS2 J4 it 

n vbsx na J2 71 
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Table 8 compared the lengxn oi 
fro * transcripts with in— e VD, ^ ^ Cl^ 
-nsgenic .use to in human — 

5 data co.es « — ^y -ward seller CDK3 

the transgenic profile skewea * aver age length of 

peptides than observed ^J^^Z\c^ This is 
CDR3 in the transgenic mouse is 10.3 ^ ^ 

substantially the sane as the average ^^^^ ^1720- 

10 authentic human CDR3 peptides by Sanz W 

1729 (1991))- 
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CDR3 Length Distribution 



^■m-irm ac- '^ ttt CDR3 



Percent occurrence (± 3%) 
*m hrnnscrenic Human FBI , 



3-8 

9-12 

13-18 

19-23 

>23 



26 
48 
26 
<1 
_<i- 



14 
41 
37 

7 

1. 



100% 



100% 



30 



35 



cvawp-LE 13 

Jinln ^,0 human leuKocyte genomic DHA libraries cloned 
^ure o£ tne following % W«f oligonucleotides. 



40 



oligo-7 



oligo-8 



3-toa gtg aag gtt tec tgc aag gca tet gga tac acc 
ttc acc-3 1 

5-tcc ctg aga etc tec tgt gca gec tct gga tto acc 
ttc ac 
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Clones that hybridized with both V and J-m probes 
are isolated and the DNA sequence of the rearranged VDJ 
segment determined. 

5 . b. construction of rearranged human heavy chain transgenes 
Fragments containing functional VJ segments (open 
reading frame and splice signals) are subcloned into the 
plasmid vector pSP72 such that the plasmid derived Xhol site 
is adjacent to the 5' end of the insert sequence. A subclone 

10 containing a functional VDJ segment is digested with Xhol and 
Pad (Pad, a rare-cutting enzyme, recognizes a site near the 
J-m intronic enhancer) , and the insert cloned into Xhol /Pad 
digested pHC2 to generate a transgene construct with a 
functional VDJ segment, the J-m intronic enhancer, the m 

15 switch element, the \l constant region coding exons, and the 7I 
constant region, including the sterile transcript associated 
sequences, the yl switch, and the coding exons. This 
transgene construct is excised with NotI and microinjected 
into the pronuclei of mouse embryos to generate transgenic 

20 animals as described above. 

EXAMPLE 14 

T.iqht Chain Transgenes 
A. construction of Plasmi d vectors 
25 1. Plasmid vector pGPlc 

Plasmid vector pGPla is digested with NotI and the 
following oligonucleotides ligated in: 

oligo-81 5 J -ggc cgc ate ccg ggt etc gag gtc gac aag ctt teg 
30 agg ate cgc-3 f 

oligo-82 5 f -ggc cgc gga tec teg aaa get tgt cga cct cga gac 
ccg gga tgc-3 f 

35 The resulting plasmid, pGPlc, contains a poly linker with Xmal, 
Xhol, Sail, Hindlll, and BamHI restriction sites flanked by 
NotI sites. 
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2. Plasmid vector pGPld 

Plasmid vector pGPla is digested with NotI and the 
following oligonucleotides ligated in: 

oligo-87 5'-ggc cgc tgt cga caa get tat cga tgg ate etc gag 
tgc -3' 

oligo-88 5'-ggc cgc act cga gga tec ate gat aag ctt gtc gac 
age -3* 

The resulting plasmid, pGPld, contains a polylinker with Sail, 
Hindlll, Clal, BamHI, and Xhol restriction sites flanked by 
NotI sites. 

15 B. Tan 1 at ion of J«c ™rl C.ie clones 

A human placental genomic DNA library cloned into 
the phage vector. XEMBL3/SP6/T7 (Clonetech Laboratories, Inc., 
Palo Alto, CA) was screened with the human kappa light chain J 
region specific oligonucleotide: 



10 



20 



oligo-36 5'- cac ctt egg cca agg gac acg act gga gat taa acg 
taa gca-3' 



and the phage clones 136.2 and 136.5 isolated. A 7.4 kb Xhol 
25 fragment that includes the Jicl segment was isolated from 
136.2 and subcloned into the plasmid pNNOS to generate the 
plasmid clone p36.2. A neighboring 13 kb Xhol fragment that 
includes Jk segments 2 through 5 together with the Ck gene 
segment was isolated from phage clone 136.5 and subcloned into 
30 the plasmid pNN03 to generate the plasmid clone p36.5. 

Together these two clones span the region beginning 7.2 kb 
upstream of J«l and ending 9 kb downstream of Cjc. 

c . r ^crt-rneti p n of rearranged light chain transgenes 
35 1. pCKl, a Ck vector for expressing rearranged variable 
segments 

The 13 kb Xhol insert of plasmid clone p36.5 
containing the C* gene, together with 9 kb of downstream 
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sequences, is cloned into the Sail site of plasmid vector 
pGPlc with the 5' end of the insert adjacent to the plasmid 
Xhol site. The resulting clone, pCKl can accept cloned 
fragments containing rearranged VJk segments into the unique 
5' Xhol site. The transgene can then be excised with NotI and 
purified from vector sequences by gel electrophoresis. The 
resulting transgene construct will cohtain the human J-Ck 
intronic enhancer and may contain the human 3' k enhancer. 

2. pCK2, a Ck vector with heavy chain enhancers for 
expressing rearranged variable segments 

A 0.9 kb Xbal fragment of mouse genomic DNA 
containing the mouse heavy chain J-fi intronic enhancer (J. 
Banerji et al., Cell 32:729-740 (1983)) was subcloned into 
PUC18 to generate the plasmid pJH22.l. This plasmid was 
linearized with SphI and the ends filled in with Klenow 
enzyme. The Klenow treated DNA was then digested with Hindlll 
and a 1.4 kb Mlul/Hindlll fragment of phage clone Xl.3 
(previous example) , containing the human heavy chain J-M 
intronic enhancer (Hayday et al.. Nature 307:334-340 (1984)), 
to it. The resulting plasmid, pMHEl, consists of the mouse 
and human heavy chain J-m intronic enhancers ligated together 
into pUC18 such that they are excised on a single 
BamHI/Hindlll fragment. This 2.3 kb fragment is isolated and 
cloned into pGPlc to generate pMHE2. pMHE2 is digested with 
Sail and the 13 kb Xhol insert of p36.5 cloned in. The 
resulting plasmid, pCK2, is identical to pCKl, except that the 
mouse and human heavy chain J-ju intronic enhancers are fused 
to the 3' end of the transgene insert. To modulate expression 
of the final transgene, analogous constructs can be generated 
with different enhancers, i.e. the mouse or rat 3' kappa or 
heavy chain enhancer (Meyer and Neuberger, EMBO J;. , 
8:1959-1964 (1989); Petterson et al., Nature . 344:165-168 
(1990)) . 



WO 93/12227 



PCT/US92/10983 



112 



3 . isolation of rearranged kappa light chain variable 
segments 

Two human leukocyte genomic DNA libraries cloned 
into the phage vector XEMBL3/SP6/T7 (Clonetech Laboratories, 
5 me, Palo Alto, CA) were screened with the human kappa Ixght 
chain J region containing 3.5 kb Xhol/Smal fragment of P 36.5. 
Positive clones were tested for hybridization with the 
following Vk specific oligonucleotide: 

10 oligo-65 5 '-egg ttc agt ggc agt ggg tct ggg aca gac ttc act 
etc acc ate agc-3 • 

Clones that hybridized with both V and J probes are isolated 
and the DNA sequence of the rearranged VJk segment determined. 

4. Generation of transgenic mice containing rearranged human 

light chain constructs. 

Fragments containing functional VJ segments (open 
reading frame and splice signals) are suhcloned into the 

20 unique Xhol sites of vectors pCKl and P CK2 to generate 
rearranged kappa light chain transgenes. The transgene 
constructs are isolated from vector sequences by digestion 
with Motl. Agarose gel purified insert is microinjected into 
mouse embryo pronuclei to generate transgenic animals. 

25 Animals expressing human kappa chain are bred with heavy chain 
minilocus containing transgenic animals to generate mice 
expressing fully human antibodies. 

Because not all VJk combinations may be capable of 
forming stable heavy-light chain complexes with a broad 

30 spectrum of different heavy chain VDJ combinations, several 
• different light chain transgene constructs are generated, each 
using a different rearranged VJk clone, and transgenic mice 
that result from these constructs are bred with heavy chain 
minilocus transgene expressing mice. Peripheral blood, 

35 spleen, and lymph node lymphocytes are isolated from double 
transgenic (both heavy and light chain constructs) animals, 
stained with fluorescent antibodies specific for human and 
mouse heavy and light chain immunoglobulins (Pharmingen, San 
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Diego, CA) and analyzed by flow cytometry using a FACScan 
analyzer (Becton Dickinson, San Jose, CA) . Rearranged light 
chain transgenes constructs that result in the highest level 
of human heavy/ light chain complexes on the surface of the 
5 highest number of B cells, and do not adversely affect the 
immune cell compartment (as assayed by flow cytometric 
analysis with B and T cell subset specific antibodies) , are 
selected for the generation of human monoclonal antibodies. 

10 D. Construction of unrearr anaed light chain minilocus 
transgenes 

1, pJCKl, a Jk, Ck containing vector for constructing 
minilocus transgenes 

The 13 kb Ck containing Xhol insert of p36.5 is 

15 treated with Klenow enzyme and cloned into Hindlll digested, 
Klenow-treated, plasmid pGPld. A plasmid clone is selected 
such that the 5» end of the insert is adjacent to the vector 
derived Clal site. The resulting plasmid, p36.5-ld, is 
digested with Clal and Klenow-treated. The Jul containing 7.4 

20 kb Xhol insert of p36.2 is then Klenow-treated and cloned into 
the Clal, Klenow-treated p36.5-ld. A clone is selected in 
which the p36.2 insert is in the same orientation as the p36.5 
insert. This clone, pJCKl (Fig. 34), contains the entire 
human Jk region and Ck, together with 7.2 kb of upstream 

25 sequences and 9 kb of downstream sequences. The insert also 
contains the human J-Ck intronic enhancer and may contain a 
human 3' k enhancer. The insert is flanked by a unique 3* 
Sail site for the purpose of cloning additional 3' flanking 
sequences such as heavy chain or light chain enhancers. A 

30 unique Xhol site is located at the 5 1 end of the insert for 
the purpose of cloning in unrearranged Vk gene segments. The 
unique Sail and Xhol sites are in turn flanked by NotI sites 
that are used to isolate the completed transgene construct 
away from vector sequences. 



35 
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2 isolation of prearranged V< gene segme-3 and generation 
of transgenic animals expressing human Ig II** chain protexn 

The V* specific oligonucleotide, oligo-65 (discussed 
above) , is used to probe a human placental genomic DNA library 
cloned into the phage vector 1EMBL3/3P6/T7 (Clonetech 
Laboratories, Inc., Palo Alto, CA> • Variable gene segments 
from the resulting clones are sequenced, and clones that 
appear functional are selected. Criteria for judgxng 
functionality include: open reading frames, intact splxce 
acceptor and donor sequences, and intact rec °^ ina ^°* 
sequence. DNA fragments containing selected variable gene 
segments are cloned into the unique Xhol site of plasmxd p-XCKl 
to generate minilocus constructs. The resulting clones are 
digested with NotI and the inserts isolated and injected xnto 
mouse embryo pronuclei to generate transgenic animals. The 
transgenes of these animals will undergo V to J 3 oxnxng xn 
developing B-cells. Animals expressing human kappa chaxn are 
bred with heavy chain minilocus containing transgenxc anxma.s 
to generate mice expressing fully human antibodxes. 

EXAMPLE 15 
Genomic, Hp^t gal" To Transgene 

This Example describes the cloning of a human 
genomic heavy chain immunoglobulin transgene which is then 
introduced into the murine germline via microinjection xnto 
zygotes or integration in ES cells. 

Nuclei are isolated from fresh human placental 
tissue as described by Marzluff , W.F. , et al. (1985) , 

^faS^^H^sfets^ ^89-129, TO Press, Oxford) . 
The isolated nuclei (or PBS washed human spermatocytes) ar- 
embedded in 0.5% low melting point agarose blocks and lyse 
with 1 mg/ml proteinase K in 500mH EDTA, 1% SDS for nuclex, or 
with img/ml proteinase K in 500mM EDTA, 1% SDS, 10mM DTT for 
spermatocytes at 50-C for 18 hours. The proteinase K xs 
inactivated by incubating the blocks in 40„g,ml PMSF xn TE for 
30 minutes at 50-C, and then washing extensxvely wxth TE. The 
DNA is then digested in the agarose with the restrxctxon 
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enzyme NotI as described by M. Finney in Current Protocols in 
Molecular Biology (F. Ausubel et al., eds. John Wiley & Sons, 
Supp. 4, 1988, e.g., Section 2.5.1). 

The NotI digested DNA is then fractionated by pulsed 
field gel electrophoresis as described by Anand et al. , Nuc. 
Acids Res - 12:3425-3433 (1989). Fractions enriched for the 
NotI fragment are assayed by Southern hybridization to detect 
one or more of the sequences encoded by this fragment. Such 
sequences include the heavy chain D segments, J segments, and 
7 1 constant regions together with representatives of all 6 V H 
families (although this fragment is identified as 670 kb 
fragment from HeLa cells by Berman et al. (1988), supra., we 
have found it to be an 830 kb fragment from human placental 
and sperm DNA) . Those fractions containing this NotI 
fragment are ligated into the NotI cloning site of the vector 
pYACNN as described (McCormick et al., Technique 2:65-71 
(1990)). Plasmid pYACNN is prepared by digestion of pYACneo 
(Clontech) with EcoRI and ligation in the presence of the 
oligonucleotide 5 • - AAT TGC GGC CGC - 3 • . 

YAC clones containing the heavy chain NotI fragment 
are isolated as described by Traver et al. , proc. Natl. Acad- 
Sei. USA . 86:5898-5902 (1989). The cloned NotI insert is 
isolated .from high molecular weight yeast DNA by pulse field 
gel electrophoresis as described by M. Finney, op. cit. The 
DNA is condensed by the addition of 1 mM spermine and 
microinjected directly into the nucleus of single cell embryos 
previously described. Alternatively, the DNA is isolated by 
pulsed field gel electrophoresis and introduced into ES cells 
by lipofection (Gnirke et al., embo J. ifl: 1629-1634 (1991)), 
or the YAC is introduced into ES cells by spheroplast fusion. 

EXAMPLE 16 

Discontinuous Genomic H eavy Chain la Transqene 

An 85 kb Spel fragment of human genomic DNA, 
containing V H 6 f D segments, J segments, the y. constant region 
and part of the 7 constant region, has been isolated by YAC 
cloning essentially as described in Example 1. A YAC carrying 
a fragment from the germline variable region, such as a 570 kb 
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NotI fragment upstream of the e -830 kb NotI fragment 
described above containing mult:_?xe copies of V x through V 5 xs 
isolated as described. (Herman et al. (198P) , supra, detected 
two 570 kb NotI fragments, each containing multxple V 
segments.) The two fragments are collected into the nucleus 
of a mouse single cell embryo as described in Example 1. 

Typically, coinjection of two different DNA 
fragments result in the integration of both fragments at the 
same insertion site within the chromosome. Therefore, 
approximately 50% of the resulting transgenic anxmals that 
contain at least one copy of each of the two fragments wxll 
have the V segment fragment inserted upstream of the constant 
region containing fragment. Of these anxmals, about 50% wxll 
carry out V to DJ joining by DNA inversion and about 50% by 
deletion, depending on the orientation of the 570 kb NotI 
fragment relative to the position of the 85 kb Spel fragment. 
DNA is isolated from resultant transgenic anxmals and those 
anxmals found to be containing both transgenes by Southern 
blot hybridization (specifically, those animals contaxnxng 
both multiple human V segments and human constant regxon 
genes) are tested for their ability to express human 
immunoglobulin molecules in accordance with standard 
techniques . 

EXAMPLE 17 

T „^-<H ea f j~ T Sf amsfeianally. rearr™^d variable region 
c ^pnees transgenic B cells 

An antigen of interest is used to immunxze (see 
Harlow and Lane, ^^^^ * laboratory Manual, Cold Spring 
Harbor, New York (1988)) a mouse with the following genetxc 
traits: homozygosity at the endogenous having chain locus for 
a deletion of J H (Examples 10) ; hemizygous for a single copy 
of unrearranged human heavy chain minilocus transgene 
(examples 5 and 14) ; and hemizygous for a single copy of a 
rearranged human kappa light chain transgene (Examples 6 and 

Following the schedule of immunization, the spleen 
is removed, and spleen cells used to generate hybridomas. 
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Cells from an individual hybridoma clone that secretes 
antibodies reactive with the antigen of interest are used to 
prepare genomic DNA. A sample of the genomic DNA is digested 
with several different restriction enzymes that recognize 
unique six base pair sequences, and fractionated on an agarose 
gel. Southern blot hybridization is used to identify two DNA 
fragments in the 2-10 kb range, one of which contains the 
single copy of the rearranged human heavy chain VDJ sequences 
and one of which contains the single copy of the rearranged 
human light chain VJ sequence. These two fragments are size 
fractionated on agarose gel and cloned directly into pUC18. 
The cloned inserts are then subcloned respectively into heavy 
and light chain expression cassettes that contain constant 
region sequences. 

The plasmid clone p?el (Example 12) is used as a 
heavy chain expression cassette and rearranged VDJ sequences 
are cloned into the Xhol site. The plasmid clone pCKl is used 
as a light chain expression cassette and rearranged VJ 
sequences are cloned into the Xhol site. The resulting clones 
are used together to transfect SP 0 cells to produce antibodies 
that react with the antigen of interest (Co. et al., Proc. 

Acad. Sci. USA 88:2869 (1991), which is incorporated 
herein by reference) . 

Alternatively, mRNA is isolated from the cloned 
hybridoma cells described above, and used to synthesize cDNA. 
The expressed human heavy and light chain VDJ and VJ sequence 
are then amplified by PCR and cloned (Larrick et al., fiisi^ 
Tpnhnoloav . 2:934-938 (1989)). After the nucleotide sequence 
of these clones has been determined, oligonucleotides are 
synthesized that encode the same polypeptides, and synthetic 
expression vectors generated as described by Queen et al. , 

Proc. Natl- Acad. Sci. USA. . 84:5454-5458 (1989). 

Tiimunization of Transge n ic Animals with Complex Antigens 

The following experiment demonstrates that 
transgenic animals can be successfully immunized with complex 
antigens such as those on human red blood cells and respond 
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with kinetics that are similar to the response kinetics 

observed in normal mice. 

Blood cells generally are suitable immunogens and 
comprise many different types of antigens on the surface of 
red and white blood cells. 

TtnTnnnizat j «Hth human blood 

Tubes of human blood from a single donor were 
collected and used to immunize transgenic mice having 
functionally disrupted endogenous heavy chain loci (V) 
harboring a human heavy chain minigene construct (HC1) , these 
nice are designated as line 112. Blood was washed and 
resuspended in 50 mis Hanks' and diluted to 1*10* cells/ml 0.2 
Ls ( W cells, were then injected interperitoneally usxng a 
28 gauge needle and 1 cc syringe. This immunxzatxon protocol 
was repeated approximately weekly for 6 weeks. Serum txters 
were monitored by taking blood from retro-orbital bleeds and 
collecting serum and later testing for specific antxbody. A 
pre-immune bleed was also taken as a control. On the very 
last immunization, three days before these animals were 
sacrificed for serum and for hybridomas, a single immunxzatxon 
of 1 x 10 7 cells was given intravenously through the taxi to 
enhance the production of hybridomas. 



Tabje 9 



ftnimals. 
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Mice # 2343 and 2348 have a desired phenotype :• human heavy 

chain mini-gene transgenic on heavy chain knock-out 
background . 

5 Generatio n of Hvbridomas 

Hybridomas were generated by fusing mouse spleen 
cells of approximately 16 week-old transgenic mice (Table 9) 
that had been immunized as described (supra) to a fusion 
partner consisting of the non-secreting HAT-sensitive myeloma 
10 cell line, X63 Ag8.653. Hybridoma clones were cultivated and 
hybridoma supernatants containing immunoglobulins having 
specific binding affinity for blood cell antigens were 
identified, for example, by flow cytometry. 

15 Flow cytometry 

Serum and hybridoma supernatants were tested using 
flow cytometry. Red blood cells from the donor were washed 4X 
in Hanks' balanced salt solution and 50,000 cells were placed 
in 1.1 ml polypropylene microtubes. Cells were incubated with 
20 antisera or supernatant from the hybridomas for 30 minutes on 
ice in staining media (lx RPMI 1640 media without phenol red 
or biotin (Irvine Scientific) 3% newborn calf serum, 0.1% Na 
azide). Controls consisted of littermate mice with other 
genotypes, cells were then washed by centrifugation at 4»C in 
25 Sorvall RT600B for 5-10 minutes at 1000 rpm. Cells were 
washed two times and then antibody detected on the cell 
surface with a fluorescent developing reagent. Two monoclonal 
reagents were used to test. One was a FITC-labeled mouse 
anti-human n heavy chain antibody (Pharmagen, San Diego, CA) 
30 and the other was a PE-labeled rat anti-mouse kappa light 
. chain (Becton-Dickenson, San Jose, CA) . Both of these 

reagents gave similar results. Whole blood (red blood cells 
and white blood cells) and white blood cells alone were used 
as target cells. Both sets gave positive results. 
35 Serum of transgenic mice and littermate controls was 

incubated with either red blood cells from the donor, or white 
blood cells from another individual, washed and then developed 
with anti-human igM FITC labeled antibody and analyzed in a 
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flow cytometer. Results showed that serum from mice that are 
transgenic for the human mini-gene locus (mice 2343 and 2348) 
show human IgM reactivity whereas all littermate animals 
(2344, 2345, 2346, 2347) do not. Normal mouse serum (NS) and 
phosphate buffer saline (PBS) were used as negative controls. 
Red blood cells were ungated and white blood cells were gated 
to include only lymphocytes. Lines are drawn on the x and y 
axis to provide a reference. Flow cytometry was performed on 
100 supematants from fusion 2348. Four supematants showed 
positive reactivity for blood cell antigens. 

EXAMPLE 18 

P ^^-Hnn of E^napnous H""s a immunoglobulin Expression 

by Ant-if ?^nse RNA 

A.. Vector for Expression of Antisense Ig Sequences 
1. construction of the cloning vector pGPlh 
The vector pGPlb (referred to in a previous example) 

is digested with Xhol and BamHl and ligated with the following 

oligonucleotides : 

5 ._ gat cct cga gac cag gta cca gat ctt gtg aat teg -3' 
5«- teg acg aat tea caa gat ctg gta cct ggt etc gag -3« 

to generate the plasmid pGPlh. This plasmid contains a 
polylinker that includes the following restriction sites: 
NotI, EcoRI, BglH, AS P 718, Xhol, BamHI, Hindlll, Notl. 

Construction of pBCEl. 

A 0.8 kb Xbal/Bglll fragment of pVH251 (referred to 
in a previous example) , that includes the promoter leader 
sequence exon, first intron, and part of the second exon of 
the human VH-V family immunoglobulin variable gene segment, 
was inserted into Xbal/Bglll digested vector pNN03 to generate 

the plasmid pVH251. 

The 2.2 kb BamHT / EcoRX DNA fragment that includes 
the coding exons of the human growth hormone gene (hGH; 
seeburg, (1982) DNA 1:239-249) is cloned into Bglll/EcoRI 
digested pGHlh. The resulting plasmid is digested with BamHI 
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and the BamHI/Bglll of pVH251N is inserted in the same 
orientation as the hGH gene to generate the plasmid pVhgh. 

A 0.9 kb Xbal fragment of mouse genomic DNA 
containing the mouse heavy chain J-m intronic enhancer 
5 (Banerji et al., (1983) Cell 33:729-740) was subcloned into 
pUC18 to generate the plasmid pJH22.1. This plasmid was 
linearized with SphI and the ends filled in with klenow 
enzyme. The klenow treated DNA was then digested with Hindlll 
and a 1.4 kb Mlul (klenow) /Hindlll fragment of phage clone Xl.3 

10 (previous example) , containing the human heavy chain J-m 

intronic enhancer (Hayday et al., (1984) Nature 307:334-340), 
to it. The resulting plasmid, pMHEl, consists of the mouse 
and human heavy chain J-m intron enhancers ligated together 
into pUC18 such that they can be excised on a single 

15 BamHI/Hindlll fragment. 

The BamHI/Hindlll fragment of pMHEl is cloned into 
BamHI/Hindlll cut pVhgh to generate the B-cell expression 
vector pBCEl. This vector, depicted in Fig. 36, contains 
unique Xhol and Asp718 cloning sites into which antisense DNA 

20 fragments can be cloned. The expression of these antisense 
sequences is driven by the upstream heavy chain promoter- 
enhancer combination the downstream hGH gene sequences provide 
polyadenylation sequences in addition to intron sequences that 
promote the expression of transgene constructs. Antisense 

25 transgene constructs generated from pBCEl can be separated 
from vector sequences by digestion with Notl. 

B. An IgM antisense transgene construct. 

The following two oligonucleotides: 

30 

5 l - cgc ggt acc gag agt cag tec ttc cca aat gtc -3 • 
5 1 - cgc etc gag aca get gga atg ggc aca tgc aga -3 1 

are used as primers for the amplif ication of mouse IgM 
35 constant region sequences by polymerase chain reaction (PGR) 
using mouse spleen cDNA as a substrate. The resulting 0.3 kb 
PCR product is digested with Asp718 and Xhol and cloned into 
Asp718/XhoI digested pBCEl to generate the antisense transgene 
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construct pMASl. The purified NotI insert of pMASl is 
microinjeoted into the pronuclei of half day mouse embryos- 
alone or in combination with one or more other transgene 
constructs-to generate transgenic mice. This construct 
5 expresses an RNA transcript in B-cells that hybridizes with 
mouse IgM mRNA, thus down-regulating the expression of mouse 
IgM protein. Double transgenic mice containing pMASl and a 
human heavy chain transgene minilocus such as pHCl (generated 
either by coinjection of both constructs or by breeding of 

10 singly transgenic mice) will express the human transgene 

encoded Ig receptor on a higher percentage of B-cell than mice 
transgenic for the human heavy chain minilocus alone. The 
ratio of human to mouse Ig receptor expressing cells is due in 
part to competition between the two populations for factors 
15 and cells that promoter B-cell differentiation and expansion. 
Because the Ig receptor plays a key role in B-cell 
development, mouse Ig receptor expressing B-cells that express 
reduced levels of IgM on their surface (due to mouse Ig 
specific antisense down-regulation) during B-cell development 

' 20 will not compete as well as cells that express the human 
receptor. 

C. An igKappa antisense transgene construct. 

The following two oligonucleotides: 

25 

5«- cgc ggt acc get gat get gca cca act gta tec -3 
5'- cgc etc gag eta aca etc att cct gtt gaa get -3» 

are used as primers for the amplification of mouse IgKappa 
30 constant region sequences by polymerase chain reaction (PCR) 
- using mouse spleen cDNA as a substrate. The resulting 0.3 kb 
PCR product is digested with Asp718 and Xhol and cloned into 
Asp718/XhoI digested pBCEl to generate the antisense transgene 
construct pKASl. The purified NotI insert of pKASl is 
35 microinjected into the pronuclei of half day mouse embryos- 
alone or in combination with one or more other transgene 
constructs— to generate transgenic mice. This construct 
expresses an RNA transcript in B-cells that hybridizes with 
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mouse IgK mRNA, thus down-regulating the expression of mouse 
IgK protein as described above for pMASl. 

EXAMPLE 19 

5 This example demonstrates the successful 

immunization and immune response in a transgenic mouse of the 
present invention. 

Immunization of Mice 

10 Keyhole limpet hemocyanin conjugated with greater 

than 400 dinitrophenyl groups per molecule (Calbiochem, La 
Jolla, California) (KLH-DNP) was alum precipitated according 
to a previously published method (Practical Immunology , L. 
Hudson and F.C. Hay, Blackwell Scientific (Pubs.). P- 9/ 

15 1980) . Four hundred /ig of alum precipitated KIiH-DNP along 

with 100 /ig dimethyldioctadecyl Ammonium Bromide in 100 /iL of 
phosphate buffered saline (PBS) was injected intraperitoneally 
into each mouse. Serum samples were collected six days later 
by retro-orbital sinus bleeding. 

20 

Vp 7 Y fiis Q * Hu man Antibody Reactivity in Serum 

Antibody reactivity and specificity were assessed 
using an indirect enzyme-linked immunosorbent assay (ELISA) . 
Several target antigens were tested to analyze antibody 

25 induction by the immunogen. Keyhole limpet hemocyanin 
(Calbiochem) was used to identify reactivity against the 
protein component, bovine serum albumin-DNP for reactivity 
against the hapten and/ or modified amino groups, and KLH-DNP 
for reactivity against the total immunogen. Human antibody 

30 binding to antigen was detected by enzyme conjugates specific 
for IgM and IgG sub-classes with no cross reactivity to mouse 
immunoglobulin. Briefly , PVC microtiter plates were coated 
with antigen drying overnight at 37 °C of 5 ng/mL protein in 
PBS. Serum samples diluted in PBS, 5% chicken serum, 0.5% 

35 Tween-20 were incubated in the wells for 1 hour at room 

temperature, followed by anti-human IgG Fc and IgG F(ab 1 )- 
horseradish peroxidase or anti-human IgM Fc-horseradish 
peroxidase in the same diluent. After 1 hour at room 
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temperature enzyme activity was assessed by addition of ABTS 
substrate (Sigma, St. Louis, Missouri) and read after 30 
minutes at 415-490 nm. 



20 



25 




TiTman BTeaT 
Transgenic Mice 

Figure 37 illustrates the response of three mouse 
iittermates to immunization with KLH-DNP. Mouse number 1296 
carried the human IgM and igG prearranged transgene and was 
10 homozygous for mouse Ig heavy chain Knockout. Mouse number 

1299 carried the transgene on a non-knockout background, while 
mouse 1301 inherited neither of these sets of genes. Mouse 
1297, another littermate, carried the human transgene and was 
hemizygous with respect to mouse heavy chain knockout. It was 
15 included as a non-immunized control. 

The results demonstrate that both human IgG and IgM 
responses were developed to the hapten in the context of 
conjugation to protein. Human IgM also developed to the KLH 
molecule, but no significant levels of human IgG were present 
at this time point. In pre-immunization serum samples from 
the same mice, titers of human antibodies to the same target 
antigens were insignificant. 

EXAMPLE 20 

This example demonstrates the successful 
immunization with a human antigen and immune response in a 
transgenic mouse of the present invention, and provides data 
demonstrating that nonrandom somatic mutation occurs xn the 
variable region sequences of the human transgene. 

» o w ^-eibodv responses f-omprising humai 

antigen 

Transgenic mice used for the experiment were 
35 homozygous for functionally disrupted murine immunoglobulin 
heavy chain loci produced by introduction of a transgene at 
the joining (J) region (supra) resulting in the absence of 
functional endogenous (murine) heavy chain production. The 
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transgenic mice also harbored at least one complete 
prearranged human heavy chain mini-locus transgene, (HCl, 
supra ) , which included a single functional V H gene (V H 251) , 
human n constant region gene, and human 7 1 constant region 
5 gene. Transgenic mice shown to express human immunoglobulin 
transgene products (supra) were selected for immunization with 
a human antigen to demonstrate the capacity of the transgenic 
mice to make an immune response against a human antigen 
immunization. Three mice of the HCl-26 line and three mice of 
10 the HC1-57 line ( supra ) were injected with human antigen. 

One hundred fig of purified human carcinoembryonic 
antigen (CEA) insolubilized on alum was injected in complete 
Freund's adjuvant on Day 0, followed by further weekly 
injections of alum-precipitated CEA in incomplete Freund's 
15 adjuvant on Days 7, 14, 21, and 28. Serum samples were 
collected by retro-orbital bleeding on each day prior to 
injection of CEA. Equal volumes of serum were pooled from 
each of the three mice in each group for analysis - 

Titres of human n chain-containing immunoglobulin 
20 and human 7 chain-containing immunoglobulin which bound to 
human CEA immobilized on microtitre wells were determined by 
ELISA assay. Results of the ELISA assays for human m chain- 
containing immunoglobulins and human 7 chain-containing 
immmunoglbulins are shown in Figs. 38 and 39, respectively. 
25 Significant human a chain Ig titres were detected for both 
lines by Day 7 and were observed to rise until about Day 21. 
For human 7 chain Ig, significant titres were delayed, being 
evident first for line HC1-57 at Day 14, and later for line 
HCl-26 at Day 21. Titres for human 7 chain Ig continued to 
30 show an increase over time during the course of the 

experiment. The observed human n chain Ig response, followed 
by a plateau, combined with a later geveloping 7 chain 
response which continues to rise is characteristic of the 
pattern seen with affinity maturation. Analysis of Day 21 
35 samples showed lack of reactivity to an unrelated antigen, 

keyhole limpet hemocyanin (KLC) , indicating that the antibody 
response was directed against CEA in a specific manner. 
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These data indicate that animals transgenic for 
hum an unrearranged immunoglobulin gene loci: (1, can respond 
to a human antigen (e.g., the human glycoprotexn, CEA) , (2) 
can undergo isotype switching ("class switching) as 
exemplified by the observed m to y class swxtch, and (3) 
exhibit characteristics of affinity maturatxon xn thexr 
numeral immune responses. In general, these data mdxeate. 
Z the human Xg transgenic mice have the ability to xnduce 
heterologous antibody production in response to a defxned 
Antigen (2) the capacity of a single transgene heavy chaxn 
^xlble region to respond to a defined antigen, (3) response 
Kinetics over a time period typical of primary and secondary 
response development, (4) class switching of a 
encoded humoral immune response from igM to IgG, and (5, the 
capacity of transgenic animal to produce human-sequence 
antibodies against a human antigen. 

p ^Hn, o£ samafeiS station in a human heavy chain 

^" Mene minilocus. multiDie 
Line HC1-57 transgenic mice, contaxning multxple 

copies of the HC1 transgene, were bred with immun 
beavy chain deletion mice to obtain mice that -ntaxn the HCl 
transgene and contain disruptions at both alleles of the 
endogenous mouse heavy chain (supra) . These mice express 
^an mu and gammal heavy chains together with mouse Kappa and 
lambda light chains (Sffira) . One of these mice was 
nvperimmunized against human carcinoembryonic antxgen by 
repeated intraperitoneal injections over the course of 1.5 
^s. This mouse was sacrificed and lymphoid cells xsolated 
from the spleen, inguinal and mesenteric lymph nodes and 
oevers patches. The cells were combined and total RNA 
Rotated. First strand cDNA was synthesized from the *NA and 
used as a template for PCR amplification with the f ollowxng 2 
oligonucleotide primers: 

149 5« -cta get cga gtc caa gga gtc tgt gec gag gtg cag ctg 
(g/a/t/c)-3 f 

151 5»-ggc get cga gtt cca cga cac cgt cac egg ttc-3 
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These primers specifically amplify VH251/gammal cDNA 
sequences. The amplified sequences were digested with Xhol 
and cloned into the vector pNN03- DNA sequence from the 
inserts of 23 random clones is shown in Fig. 40; sequence 
5 variations from germline sequence are indicated, dots indicate 
sequence is identical to germline. Comparison of the cDNA 
sequences with the germline sequence of the VH251 transgene 
reveals that 3 of the clones are completely unmutated, while 
the other 20 clones contain somatic mutations. One of the 3 

10 non-mutated sequences is derived from an out-of -frame VDJ 
joint. Observed somatic mutations at specific positions of 
occur at similar frequencies and in similar distribution 
patterns to those observed in human lymphocytes (Cai et al. 
(1992) .7. Em. Med. 176; 1073, incorporated herein by 

15 reference) . The overall frequency of somatic mutations is 
approximately 1%; however, the frequency goes up to about 5% 
within CDR1, indicating selection for amino acid changes that 
affect antigen binding. This demonstrates antigen driven 
affinity maturation of the human heavy chain sequences. 

20 

EXAMPLE 21 

This example demonstrates the successful formation 
of a transgene by co- introduction of two separate 
polynucleotides which recombine to form a complete human light 
25 chain minilocus transgene. 

r-onoration of an unrea r ranaed licrht chain minilocus transgene 

by co-injection of two overlappin g DNA fragments 

1. Tflolation of unrearranaed funct ional V w gene segments. 

30 vk65.5. VK6S.8 and VK65.15 

The V K specific. oligonucleotide, oligo-65 (5'-agg 
ttc agt ggc agt ggg tct ggg aca gac ttc act etc acc ate agc- 
3 • ) , was used to probe a human placental genomic DNA library 
cloned into the phage vector XEMBL3/SP6/T7 (Clonetech 

35 Laboratories, Inc., Palo Alto, CA) . DNA fragments containing 
V K segments from positive phage clones were subcloned into 
plasmid vectors. Variable gene segments from the resulting 
clones are sequenced, and clones that appear functional were 
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selects, criteria for judging functionality include: open 
reading frames, intact splice acceptor and donor seguences, 
and intact recombination seguence. DNA seguences of 4 
actional V, gene segments 0*65.3, *.«.». v«5 8 and 
v*65 15) from 4 different plasmid clones xsolated by thxs 
procedure are shown in Figs. 4!-44. The four plasmid clones, 
p S 5.3f, p*5.5gl, P s5 - lSf > "* dMCrlbed bel °"- 

fl a) p65.3f 

A 3 kb Xba fragment of phage clone X65.3 was 
subclone* into pUC19 so that the vector derived Sail site was 
proximal to the 3. end of the insert and the vector derived 
BamHX site 5.. Tbe 3 kb BamHX/SalX insert of tbxs clone was 
subcloned into pGPlf to generate P 65.3f. 

(1 b) p65.5gl ^ ^ fragment of phage clone \65.5 was 

subcloned into pGPlf so that the vector derived XhoX site xs 
proximal to the 5- end of the insert and the vector derived 
Sail site 3'. The resulting plasmid is designated P 65.5gl. 

CI c) P65.8 ^ ^ ^ ^ cione 8 was 

cloned into pSP72 to generate p65.8. 

fl d) p65.15f 

A 10 kb EcoRI fragment of phage clone X65.16 was 
subcloned into P UC18 to generate the plasmid P«^5.3 TheJ, 
gene segment within the plasmid insert was mapped to a 4.6 kb 
^L/HxIdXXX subfragment, which was cloned ^Tites 
resulting clone, P 65.l5f, has unique XhoX and Sail sites 
located at the respective 5- and 3 • ends of the insert. 



35 



2 P1CV4- 

The Xhol/Sall insert of P 65.8 was cloned into the 
Xhol site of P 65.15f to generate the plasmid P KV2. The 
Xhol/Sall insert of P 65.5gl was cloned into the XhoX site of 
pKV2 to generate pKV3. The Xhol/Sall insert of P KV3 was 
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cloned into the Xhol site of p65.3f to generate the plasmid 
pKV4. This plasmid contains a single 21 kb Xhol/Sall insert 
that includes 4 functional V K gene segments. The entire 
insert can also be excised with Notl. 

5 

3. DKC1B 
(3 a) pKcor 

Two Xhol fragments derived from human genomic DNA 
phage X clones were subcloned into plasmid vectors. The 

10 first, a 13 kb J^-J^/C^ containing fragment, was treated with 
Klenow enzyme and cloned into Hindlll digested, Klenow 
treated, plasmid pGPld. A plasmid clone (pK-31) was selected 
such that the 5« end of the insert is adjacent to the vector 
derived Clal site. The second Xhol fragment, a 7.4 kb piece 

15 of DNA containing J K 1 was cloned into XhoI/Sall-digested 
pSP72, such that the 3' insert Xhol site was destroyed by 
ligation to the vector Sail site. The resulting clone, 
p36.2s, includes an insert derived Clal site 4.5 kb upstream 
of J x l and a polylinker derived Clal site downstream in place 

20 of the naturally occurring Xhol site between J K 1 and J K 2. This 
clone was digested with Clal to release a 4.7 kb fragment 
which was cloned into Clal digested pK-31 in the correct 5- to 
3' orientation to generate a plasmid containing all 5 human J x 
segments, the human intronic enhancer human C^, 4.5 kb of 5 1 

25 flanking sequence, and 9 kb of 3 ■ flanking sequence. This 
plasmid, pKcor, includes unique flanking Xhol and Sail sites 
on the respective 5 f and 3 f sides of the insert. 

(3 b) pKcorB 

30 A 4 kb BamHI fragment containing the human 3 1 kappa 

enhancer (Judde, J.-G. and Max, E.E. (1992) Mol. Cell. Bjo?... 
12 : 5206, incorporated herein by reference) was cloned into 
pGPlf such that the 5 9 end is proximal to the vector Xhol 
site. The resulting plasmid, p24Bf, was cut with Xhol and the 

35 17.7 kb Xhol /Sail fragment of pKcor cloned into it in the same 
orientation as the enhancer fragment. The resulting plasmid, 
pKcorB, includes unique Xhol and Sail sites at the 5* and 3 f 
ends of the insert respectively. 
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(3 c) pKClB 

The Xhol/Sall insert of pKcorB was cloned into the 
Sail site of P 65.3f to generate the light-chain minilocus- 
transgene plasmid pKClB. This plasmid includes a single 
functional human V K segment, all 5 human J K segments, the human 
intronic enhancer, human c., and the human 3- kappa enhancer. 
The entire 25 kb insert can be isolated by NotI digestion. 



4. CQ4 

10 Tfc two NotI inserts from plasmids pKV4 and pKClB 

were mixed at a concentration of 2.5 ng/nl each in 
microinjection buffer, and co-injected into the pronuclei of 
half day mouse embryos as described in previous examples. 
Resulting transgenic animals contain transgene inserts 

15 (designated Co4, product of the recombination shown in Fig. 
45) in which the two fragments co- integrated. The 3' 3 kb of 
the P KV4 insert and the 5-3 kb of the pKClB insert are 
identical. Some of the integration events will represent 
homologous recombinations between the two fragments over the 3 

20 kb of shared sequence. The Co4 locus will direct the 

expression of a repertoire of human sequence light chains xn a 

transgenic mouse. 

The foregoing description of the preferred 

embodiments of the present invention has been presented for 
25 purposes of illustration and description. They are not 

intended to be exhaustive or to limit the invention to the 

precise form disclosed, and many modifications and variations 

are possible in light of the above teaching. 

All publications and patent applications herein are 
30 incorporated by reference to the same extent as if each 

individual publication or.patent application was specifically 

and individually indicated to be incorporated by reference. 

Although the present invention has been described in 

some detail by way of illustration for purposes of clarity of 
35 understanding, it will be apparent that certain changes and 

modifications may be practiced within the scope of the claxms. 
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WHAT IS CLAIMED IS: 

TRANS GENE CLAIMS 
1. An isolated immunoglobulin heavy chain 
5 transgene that is expressed in B cells of a transgenic 

nonhuman animal containing at least one integrated copy of a 
polynucleotide comprising a DNA sequence of the formula: 

(V H )x-( D )y-< J H)z-( S D)m-( C l)n-[( T )-( S A)p-( C 2)3q 

10 

wherein x, y, z, m, n, p, and q are integers and x is 2-100, 
n is 1-10, y is 2-50, p is 1-10, z is 1-50, q is 0-50, and m 
is 0-10 . 

15 2. A transgene of Claim 1, wherein said 

polynucleotide comprises at least one heterologous D gene 
segment that can be incorporated into a functionally 
rearranged V-D— J sequence. 

20 3. A transgene of Claim 2, wherein said 

heterologous D gene segment contains at least one human D 
gene. 

4. A transgene of Claim 1, wherein said 

25 polynucleotide comprises a human m C h gene segment and a human 
7l °H 9 ene segment. 

5. A transgene of Claim 1, wherein q is at least 
1, m is at least 1, n is at least 1, and said polynucleotide 

30 comprises at least about 50 basepairs of a segment immediately 
• upstream of a germline switch sequence. 

6. A transgene of Claim 5, wherein said 
polynucleotide comprises about 200 basepairs of sequence 

35 immediately upstream of a human germline y x switch sequence. 

7. A transgene of Claim 1, wherein a S D segment is 
a 7 X switch sequence. 
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wherein said 



8- A transgene of Claim 1, 

upstream of a human germane Ti switch sequen . t _ h 
said 200 hasepairs are operahly linKed to a human n svxtch 
5 sequence. 

* ^laiin l wherein said D region 
g , A transgene of Claim i, » UC1 - C 

comprises only heterologous D genes. 

~* ri a in 9 wherein said D region 
1Q 10. A transgene of Claim 9, 

comprises only human D genes. 

11. A transgene of Claim 1, wherein the 
polynucleotide is functionally rearranged in ^ to produce a 

15 rearranged V-D-T gene segment that contains a recognizable D 
region gene sequence. 

12. A transgenic non-human animal comprising a 
transgene of claim 1 in the germline of said non-human animal. 

20 13 . A transgenic non-human animal of Claim 12, 

wherein said transgene is rearranged. 

14. A transgenic non-human animal of claim 12, 
25 wherein said transgene is unrearranged. 

15 A transgenic non-human animal of Claim 14, 
wherein said B cells produce a heterologous antibody. 

16 A transgenic non-human animal of Claim 15, 
wherein said B cells produce a population of heterologous 
antibodies of more than one isotype. 

17 a transgenic non-human animal of Claim 12, 
35 wherein said transgene encodes V H , D, J n . and C H regions. 

18. A transgenic non-human animal of Claim 12 
wherein said non-human animal is a rodent. 



30 
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19 . A transgenic non-human animal of Claim 12 , 
wherein serum of said animal comprises human antibodies that 
contain a recognizable D region gene sequence. 

5 20. A transgenic non-human animal of Claim 12 , 

wherein at least one lymphocyte of said animal contains a mRNA 
encoding a heterologous immunoglobulin chain. 

21. A transgenic animal of claim 20, wherein said 
10 mRNA contains a recognizable D region gene sequence. 

22. A transgenic animal of claim 21, wherein said 
mRNA contains a functionally rearranged V-D-J sequence. 

15 23. A transgenic animal of claim 12 , wherein said 

transgenic animal comprises heterologous antibodies which 
comprise a human-sequence p chain and which specifically bind 
to an antigen. 

20 24. A transgenic animal of claim 23, further 

comprising heterologous antibodies which comprise a human- 
sequence 7 chain and which specifically bind to an antigen. 

25. A transgenic animal of claim 24, wherein the 
25 antigen is a human antigen. 

26. A transgenic animal of claim 25, wherein the 
human antigen is CEA or a human blood cell antigen. 

30 27. A transgenic nonhuman animal comprising a human 

transgene of claim 1, wherein the transgenic animal further 
comprises lymphoid tissue containing a population of mRNA 
species having somatic mutations clustered in CDR regions of a 
variable region encoded by an immunoglobulin transgene. 

35 

28. A transgenic nonhuman animal of claim 27, 
wherein the immunoglobulin transgene is a heavy chain 
minilocus corresponding to HC1. 
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29. A hybridoma comprising a transgenic non-human B 
cell fused with a second cell capable of immortalizing said B 
cell, wherein said hybridoma produces a monoclonal antibody 
heterologous to said non-human animal. 

30. A hybridoma of Claim 29 wherein the 
heterologous antibody comprises a human heavy chain containing 
a recognizable D region gene sequence. 

31. A hybridoma of Claim 29 wherein said B cell is 
of murine origin. 

32. A hybridoma of Claim 29 wherein the monoclonal 
antibody binds to a human antigen with an affinity of at least 

is ix io 7 tr 1 . 

33. A human monoclonal antibody produced by a 
hybridoma of Claim 29. 

34. A transgenic non-human animal having serum 
20 comprising detectable heterologous antibodies and having at 

least one suppressed endogenous immunoglobulin locus. 

35. A method for producing heterologous 
immunoglobulins from a transgenic nonhuman animal, the animal 
having a genome comprising germline copies of at least one 
transgene of Claim 1, the method comprising: 

suppressing an endogenous immunoglobulin locus; 
contacting the animal with a preselected antigen; 



25 



30 



and 

collecting said heterologous immunoglobulins. 

36. A method according to Claim 35, wherein 
suppression is produced by an antisense polynucleotide. 

35 37. A method according to Claim 36, wherein the 

antisense polynucleotide is transcribed from an integrated 
antisense transgene. 
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38. A method according to Claim 37, wherein 
transcription of the antisense transgene produces a transcript 
containing an antisense sequence linked to a second sequence. 

5 39. a method according to Claim 37, wherein the 

non-human animal has a genome comprising germline copies of at 
least one light chain immunoglobulin transgene. 

40. A method for suppressing at least one 

10 endogenous immunoglobulin locus in a transgenic non-human 

animal, comprising the steps of: 

introducing an antisense transgene into a non-human 

animal to produce a non-human transgenic animal bearing an 

antisense transgene; 
15 transcribing antisense RNA from the antisense 

transgene in vivo : 

hybridizing the antisense RNA to a polynucleotide 
containing an endogenous immunoglobulin sequence; and 
inhibiting expression of an endogenous 
20 immunoglobulin chain. 

41. A method of Claim 40 , wherein said antisense 
transgene contains a nucleotide sequence that is homologous to 
an endogenous kappa chain immunoglobulin gene sequence. 

25 

42. A method of Claim 40 , wherein said antisense 
transgene contains a nucleotide sequence that is homologous to 
an endogenous heavy chain immunoglobulin gene sequence. 

30 43. A method for inactivating an endogenous 

immunoglobulin gene, comprising the steps of: 

integrating a targeting vector into an endogenous 
immunoglobulin gene; and 

selecting for a cell bearing an integrated targeting 

35 vector. 



44 • A method according to Claim 43, further 
comprising the step of: 
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generating a line of non-human animals bearing a 
copy of the integrated, targeting vector. 

45. A method according to Claim 43, wherein the 
5 endogenous immunoglobulin gene is a light chain gene. 

46. A method according to Claim 45, wherein the 
light chain gene is a kappa light chain gene. 

10 47. A method according to Claim 43, wherein the 

endogenous immunoglobulin gene is a heavy chain gene. 

48. A method according to Claim 43, wherein the 
endogenous immunoglobulin gene is a murine immunoglobulin 

15 gene. 

49. A method according to Claim 44, wherein the 
non-human animal is a mouse. 

20 so. A method according to Claim 49, wherein the 

mouse further comprises a human immunoglobulin transgene of 
Claim 1. 

51. A method according to Claim 43, wherein the 
25 cell is an embryonic stem cell. 

52. A method of generating a non-human animal 
having an inactivated endogenous immunoglobulin gene, 

comprising the steps of: 
30 breeding a line of non-human animals generated by 

the method of Claim. 44; and 

identifying individual non-human animal offspring 
that are homozygous for an inactivated immunoglobulin gene. 

35 53. A method according to Claim 52, wherein said 

offspring have an inactivated heavy chain immunoglobulin gene 
and at least one inactivated light chain immunoglobulin gene. 
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54. A transgenic non-human animal with an 
inactivated endogenous immunoglobulin gene. 

55. A non-human animal of Claim 54, wherein an 
5 integrated targeting vector is present in germline DNA. 

56. A non-human animal of Claim 55, wherein a light 
chain immunoglobulin gene is inactivated. 

10 57. A non-human animal of Claim 55, wherein a heavy 

chain immunoglobulin gene is inactivated. 

58. A non-human animal of Claim 55 which is 
homozygous for at least one inactivated endogenous 
15 immunoglobulin gene. 

59 • An antisense transgene comprising a nucleotide 
sequence that is complementary to a polynucleotide sequence 
that is substantially identical to an immunoglobulin gene 
20 sequence. 

60. A transgenic non-human animal bearing an 
antisense transgene of Claim 59. 

25 61. A transgenic non-human animal comprising a 

functionally disrupted endogenous heavy chain locus and a 
heterologous immunoglobulin heavy chain transgene, wherein 
said animal makes an antibody response following immunization 
with an antigen. 

30 

62. A transgenic non-human animal of Claim 61, 
wherein said functionally disrupted endogenous heavy chain 
locus is a J H region homologous recombination knockout, said 
heterologous immunoglobulin heavy chain transgene is the HC1 
35 human minigene transgene, and said antigen is a human antigen. 



63. A transgenic non-human animal of Claim 61, 
wherein the antibody response comprises a population of 
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antibodies which comprise human p chain-containing 
immunoglobulins and human y chain-containing immunoglobulins. 

64. A transgenic non-human animal of Claim 63, 
wherein the heterologous antibodies comprise a population of 
heterologous immunoglobulins which comprise somatic mutation 
in the variable regions which cluster in the CDRs. ■ 

65. A transgenic non-human animal of Claim 63 , 
wherein the antigen is selected from the group consisting of 
human blood cell surface antigens, KLH, and human CEA. 
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TTCGATCTCTGGGGCCGTGGCACCCTGGTCACTGTCTCCTCAG 
PDLWGRGTLVTVSS 

TTTGATATCTGGGGCCAAGGGACAATGGTCACCGTCTCTTCAG 
FDXWGQGTNVTVSS 
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TTTTCIG GOC TGACAACCAG GGTGGCGCAG GATGCTCAGT GCAGAGAGGA 50 
AGAAGCAGGT GGTCTCTGCA GCTGGAAGCT CAGCTOOCAC CCAGCTGCTT 100 



TGCATCTOCC TXXX2GCTGC CCTACCTTCC AGAGOCCfelA, TTCWlftrSG 150 

TCICAGAGCC CIGGGGAGGA ACTGCTCAGT TAGGACCCAG AGGGAACCAT 200 

Ma 

GGAAGCCCCA GCTCAGCTTC TCTTCCTCCT GCTACTCTGG CTOCCAGgtg 250 
tGluAlaPro AlaGlriLeuL euPheLeuLe uLeuLeuTrp teuPrb 
agggggaacc atgaggtggt tttgcacatt agtgaaaact cttgccacct 300 

ctgctcagca agaaatataa ttaaaattca aagtatatca acaattttgg 350 

ctctactcaa agacagttgg tttgatcttg attacatgag tgcatttctg 400 

ttttatttcc aatttcagAT ADCACCGGAG AAA1TGTGTT GACACAGTCT 450 

Asp ThrThrGlyG luIleValLe uBirGlnSer 
OCAGCCACCC TGTCTTTGTC TOCAGGGGAA AGAGOCAQCC TCTCCTGCAG 500 
ProAlalEhrL euSerLeuSe xProGlyGlu ArgAlaThrL euSerCysAr 
GGCCAGTCAG AGTGTTAGCA GCTACTTAGC CTGGI&OCAA. CACTAACCTG 550 
gAlaSerGln SerValSerS erTyrLeuAl aTrpTyrGln GlnLysProG 
GOCAGGCTOC CAGGCTOCTC ATCTATGATG CATCCAACAG GGOCACTGGC 600 
lyGlnAlaPr oArgLeuLeu IleTyrAspA laSerAsnAr gAlaThrGly 
ATOCCAGOCA GGTTCAGTQG CAGTGGGTCT GGGACAGACT TCACTCTCAC 650 
IleProAlaA rgPheSerGl ySerGlySer GlyThrAspP heThrLeuTh 
CATCAGCAGC CTAGAGOCTG AAGATTTTGC AGTTTAITAC TGTCAGCAGC 700 
rlleSerSer LeuGluProG luAspPheAl aValTvrTvr CysGlnGlnA 
^r^var^ rmrTt-Ara ciTfkrvcrzr. AT^flfrraa* aanrt-raAra 750 

rgSerAsnTr pPro 

AGACCATCAG TGTTTACIAG ATTATHAIAC CAGCTGCTTC CTTTACAGAC 800 

AGCTAGTGGG GT 812 
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AGAIGCTCAG TOCAGaGftGA AGAftACAGGT GGTCTCTGCA. 50 

GCTGGAAGCT CAGCICOCAC COTGCTGCT TTGCATGTOC CTOCCAGCTG 100 

OCTKCTTC OO^XcElSSSIgCCT GGGICAGftGC TCTGGGGAGS 150 

AftCTCCTCAG TT&3GAG0CA. GAOQGAAOCA TQGAAGOCCC AGOGCAGCTr 200 

M etGluAlaPr oAlaGlnLeu 

CTLT1XXTCC TGCTa CICIG GCTCACAGgt gaggggaata tgaggtgtct 250 
LeuPheLeuL euLeuLeuTr pLeuThr 

ttgcacatca gtgaaaactc ctgccacctc tgctcagcaa gaaatataat 300 

taaaattcaa aatagatcaa caattttggc tctactcaaa gacagtgggt 350 

ttgattttga ttacatgagt gcatttctgt tttatttcca atttcagAIA 400 

- AspT 
(XIROOGGAGA AATTGTGTTG JOI3G1CIC CAGCCAGGCT GTCTTTGTCT 450 
hrThiGlyGl uIle ValLe u ThrGlnSerP roAlaThrDs uSerl^uSer 
0C3GGGG&AA Q^GCCACCCT CTCCTGCAGG GCX3GTCAGG CTGTIftGCaG 500 
ProGlyGluA rgAlaThrliS uSexCysArg AlaSerGLnG lyValSecSe 
CTACTIftGCC TGSTftOCaGC AGAAACCTGG CCAGGCICCC AGGCT0CTCA 550 
rTyrLeuAla TriflyrGlnG InLysProGI yGlnAlaPro ArgLeuLeuI 
TCTATGATQC ATCCAACAGG GOCACTQGCA. TCOCAGOC^G GTTCAGTGGC 600 
leTyrAspAl aSerftsnArg AlaThrGlyl leProAlaAr gPheSerGly 
ftGIGGQCCIG GSOGACTT CACICTCACC ATCAGCAGCC TAGAGCCTGA. 650 
SerGlyProG lyThr&spPh eThrLeuThr IleSerSerL euGluProGl 
AGKTTT1GCA G3M3CT GTC&GCftGCG XAQCAACIGG CMXTtaGBd 700 
uAspPheAla Val TvrTvrC ysGlnGlnAr gSerftsnTrp His 
RHaTrrrACR TqflforaRaa RfYtT CAACft& GAOCATCAGT GTTTACTAGA. 

TTATEAIACC AQCTQCTTCC TTTAC&GZO. GCIAGTGGGG TGGCCACTCA. 

CTGTIAGC^T L-IX^GCTCTR. TTTQGCCRTT TTGGAGTTCA, AGTTGTCAAG 850 

TOCftRAfiTTA. CTS&TGTTft G TOCKFTGCRT CRIfiOCRTTT CAGTGTGGCT 900 



750 
800 
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CCGCCOCAGC TCCHTCCAT GTCCCTCOCA GOOGCGCTQC: AGTCCAGAGC 50 

rrbmvTraAH GCCTGGGTCA. GAGCTCTGGA GAAGAGCTGC TCAGTTAGGA 100 

ACOCCAGAGG GAACCA2GGA AAOCCCAGOG CAGCTTCTCT TCCTCCTGCT 150 

M=tGl ulhrProAla GlnLeuLeuP heLeuleuI^ 

AC1CTGGCTC CCAGgtgagg ggaacatggg atggttttgc atgtcagtga 200 
uLeuTrpIfiu Pro 

aaaccctctc aagtcctgtt acctggcaac tctgctcagt caatacaata 250 

attaaagctc aatataaagc aataattctg gctcttctgg gaagacaatg 300 

ggtttgattt agattacatg ggtgactttt ctgttttatt tccaatctca 350 

gATAOCACOG GAGAAA1TGT GTTGACGCAG TCTCCAGGCA CGCTGTCTTT 400 
AscflhrThrG lyGluIleVa ILeuThrGln SerPrcGlyT hrLeuSerla 
GTC1CCAGGG GAAAGAGCCA CCCICTCCTG CAQGGOCAGT CAGAGTGTTA 450 
uSerProGly GluArgAlaT hrLeuSerCy sftrgAlaSer GlnSerValS 
GCBGCAGCIA. CnAGOCTGG TACCAGCAGA AACCTGQCCA GU-IUXAGG 500 
erSerSerTy rLeuAlalrp TyrGlnGlnL ysProGlyGl nAlaProArg 

c iu^JO fl s r . aaxasrocftTC cagcagggcc actggcatcc caGacaGGTT 550 

LeuLeuIleT yrGlyAlaSe rSerArgAla ThrGlylleP roAspArgPh 
CAGTGGCAGT gggtctggga CaGACTICAC TCTCAOCATC AGCAGACTGG 600 
eSerGlySer GlySerGlyT hrAspPheTh rLeuThrlle SerArgl^uG 
AGOCTGAAGA TTTTGCAGTG TATTACTGTC AGCAGTATGG TAGCTCACCT 650 
luProGluAs pPheAlaVal TvrTvrCvsG lnGlnTyrGl ySerS erPro 
CX^Ha TTCAGCTTGA akraftAAarrl TCTGCAAGAC CTTCATTGTT 700 

TACTAGA1TA TAOCAGCTGC TTOCTTTACA GATAGCTGCT GCAATGACAA 750 



CTCAA3TTAG CATCTCTCTC TGCTTGGGCA TTTTGGGGAT CTTAAAAAAG 
TAAIG0CTTG AJAIA lTm ' GACTCTGATT CCTGCAITTTT TCCTCAGACC 



800 
850 



AAGAIGGACA G0CAGGTTTA AGGACAGTTT CACAGTAATG GCCACTGGAT 900 
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AAACACATTC TCTGCAGACA. AATITGAGCT AarTTGATCT TACCTGGACA 50 

GGTQGGGACA CTGAGCTGGT GCTGAGTTAC TCAGATGCGC CAGCTCTGCA 100 

GCTCK3CCCA GCCTGGOCCA. TOCOCTGCTC AITTGGATGT TCTCAGAGCA 150 

CAAJXTCCTC CCCTGAAGCC Bm&G GCTGGTCAGA. CTTTCTGCAG 200 

GAAICAGACC CAGTCAGGAC ACBGCAIGGA CAIGAGGGTC CTCGCICAGC 250 

MstAs f^fetArgVal LeuAlaGlnL 

TOCTQGGQCT CCTGCTGCTC TGTTTCCCAG gtaaggatgg agaacactag 300 
euLeuGlyLe uLeuLeuLsu CysPhePro 

cagtttactc agcccagggt gctcagtact gctttactat tcagggaaat 350 

tctcttacaa catgattaat tgtgtggaca tttgttttta tgtttccaat 400 

CtcagGTGCC AGAIGTGACA TOCftGATGRC CCAGTCTCCR. TOCTCACICT 450 

GlyAla ArgCysAspI leGlnMetTh rGlnSerPro SerSerieuS 
CH^CAICTOT AGGAGACAGA. CTTGTCGGGC GAGTCAGGGT 500 

erftlaSeiVa lGlyAspAxg ValThrlleT hrCysArgAl aSerGlnGly 
AIIAGCAGCT GGITAGOCTG GTATCAGCAG AAACCAGAGA. AAGCCOCTAA 550 
IleSerSerT rpLeuAlaTr pflyrGlnGI n LysProGluL ysAlaProLy 

gtccctgatc tatgctgcat co^gtttgca aagtqgggtc ogatcaaggt 600 

sSerLeuIle TyrAlaAlaS erSerLeuGl nSerGlyVal ProSerArgP 
TCAGCGGCAG TGGAICTQGG ACAGATTTCA CTCTC^CCAT CAGCAGCCTG 650 
heSerGlySe rGlySerGly ThrAspPheT hrLeuThrxl eSerSerLeu 
CAGCCTGftAG ATTTIGCAAC TTATTACTGC C&ACAGTAIA ATAGTTACCC 700 
Gi riProGluA. spPheAlaTh r TyrTyrCys G lnGlnTyrA. snSerTyrPr 
arrt-araf?tyt TTACACACCC aahrATAAAr rtYrararaA GCAGATGTGT 750 

o 

GAGGCTGGGC TQ00QCAGCT GCTTCTCCTG ATGCCTCCAT CAGCTGAGAG 800 
TGTTCCTCAG ATGCAGCCAC ACTCTGATGG TGTTGGTAGA TGGGGAC 847 
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